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Description 

BACKGROUND OF THE INVENTION 
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bacteria and fragments thereof, polypeptides recording media In which the nucle- 

:rseT/nrofrX^^^^^ 

ruslngre plSnudeotide and/or polypeptide sequence infomiation to make compansons. 
2 Brief Description of the Background Art 

tants thereof are known. . _ « oram nneitive bacterium Identified as a glutamic acid-pro- 

auclrg toCenun., .nd msn, m,,™ •'^ ''"""flSJ™^ 

lysine, for example, a microorganism beloi^ ng to the ^^^Z^^^^^^^^tL activity of a biosynthesis enzyme 
*e excessive production by concerted Inh.brt.on by lys.ne and thr«,n.ne ^^^^^^^^^ Jgjj. The biosyn- 
common to lysine, threonine and methionine, i.e., ^^^^.^^^^^^ as not to bios^- 

rsi-reTstrrs^rrw 
srmrts-prucrgrercrd;:^^^^^^^^ 

substances so as to improve the productivity molecular biological data on corynefom, bacteria 

lOOOSl However, «-7"';'°"^'^,^^J"S^'£^^^ havebeen obtained 

'o^'sr;re:ra^rs^^^^^^^ 

regulating the growth and metabolism ^^^Ijf ^^'^^^"f "IJ^^^,^^ ATCC 1 3032 is reported and It is known that 

the nucleotide sequences of most genes have not ^''^^j!^^^^^ mteroorganlsms. such as Escherfchia 
100071 in recent years, the full nucleotide se<}"ence of the genome of se^^ ^ 

SrciZ^" t :;CT„^^^^^^^^^^^^^^^ S-ber^'of genes have been presumed, without genetic. biochem«a. 
or molecular biological experiments. „„„i,orina exoresslon levels of a great number of genes simulta- 

[0008] in recent years, moreover. *«i''"J"«f^^^^^^^^^ Xe Zi" ^ ch a partial nuclete acid fragment of a 
neously or detecting mutations, using DNA chips. °NA arrays or the « p ^^^^ 

9ft 12833.38 (1999); Sciencei 2B4: 1520-23 (1999)). 
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SUMMARY OF THE INVENTION 

[0009] An object of the present Invention is to provide a polynucleotide and a polypeptide derived from a mteroor- 
ganlsm of coryneform bacteria which are Industrially useful, sequence information of the polynucleotide and the 
5 polypeptide, a method for analyzing the microorganism, an apparatus and a system for use In the analysis, and a 
method for breeding the microorganism. 

[0010] The present Invention provides a polynucleotide and an oligonucleotide derived from a microorganism be- 
longing to corynefomi bacteria, oligonucleotide arrays to which the polynucleotides and the oligonucleotides are fixed, 
a polypeptide encoded by the polynucleotide, an antibody which recognizes the polypeptide, polypeptide arrays to 
10 which the polypeptides or the antibodies are fixed, a computer readable recording medium In which the nucleotide 
sequences of the polynucleotide and the oligonucleotide and the amino acid sequence of the polypeptide have been 
recorded, and a system based on the computer using the recording medium as well as a method of using the polynu- 
cleotide and/or polypeptide sequence information to make comparisons. 

15 BRIEF DESCRIPTION OF THE DRAWING 

[001 1] Fig. 1 Is a map showing the positions of typical genes on the genome of Corynebacterfum glutamfcum ATCC 
13032. 

[0012] Rg. 2 Is electrophoresis showing the results of proteome analyses using proteins derived from (A) Coryne- 
20 bacterium glutamicum ATCC 13032. (B) FERM BP-7134, and (C) FERM BP-158. 

[0013] Fig. 3 is a flow chart of an example of a system using the computer readable media according to the present 
invention. 

[0014] Fig. 4 is a flow chart of an example of a system using the computer readable media according to the present 
Invention. 

25 

DETAILED DESCRIPTION OF THE INVENTION 

[0015] This application is based on Japanese applications No. Hel. 11-377484 filed on December 16, 1999, No. 
2000-159162 filed on April 7, 2000 and No. 2000-280988 filed on August 3, 2000, the entire contents of which are 

30 incorporated hereinto by reference. 

[0016] From the viewpoint that the detenmlnatlon of the full nucleotide sequence of Corynebacterfum glutamicum 
would make it possible to specify gene regions which had not been previously identified, to determine the function of 
an unknown gene derived from the microorganism through comparison with nucleotide sequences of known genes 
and amino acid sequences of known genes, and to obtain a useful mutant based on the presumption of the metabolte 

35 regulatory mechanism of a useful product by the microorganism, the inventors conducted intensive studies and, as a 
result, found that the complete genome sequence of Corynebacterium glutamicum can be determined k>y applying the 
whole genome shotgun method. 

[0017] Specif teally, the present Invention relates to the following (1) to (65): 
^ (1 ) A method for at least one of the following: 

(A) Identifying a mutatk)n point of a gene derived from a mutant of a corynefonm bacterium, 

(B) measuring an expression amount of a gene derived from a corynefomri bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium, 
^ (D) analyzing expression pattems of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium, 
said method comprising: 

(a) producing a polynucleotide an^ay by adhering to a solid support at least two polynucleotides selected 
^ from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 

one of SEQ ID NOS: 1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 
stringent conditions, and third polynucleotides comprising a sequence of 10 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide an^ay with at least one of a labeled polynucleotide derived from a co- 
ryneform bacterium, a labeled polynucleotide derived from a mutant of the coryneform bacterium or a 
labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 
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As used herein, for example, the at least two polynucleotides can be at least two of the first pojynu- 
cleotiSes. at least t^o of the second polynucleotides, at least two of the third polynucleofdes. or at least 
two of the first, second and third polynucleotides. 

(2) The method according to (1), wherein the corynefomi bacterium Is a microorganism belonging to the genus 
Corynebacterium. the genus Brevibacterium. or the genus Microbactenum. ^ , i. 

^?he method accorJng to (2). wherein the microorganism belonging to the genus <^^Vne>bacteaumte^^ 
rem the group consisting of Corynebacterium glutamicum. Corynebacteriurr, acetoaadopMurn. <^Vneb^rium 
Zeto9lutamLrn.CorynebactBriurncallurme. Coryrwbacterium herculls. Coryrtebactenum l.bum. Ccryrwbacteri. 
urn rnelassecola Corynebacterhjrnthenmtarribiogeries. and Coryrwbacierlurnarnrn^^^ 

W ^!.fm1SS ;Sing to (1 ). wherein the polynucleotide derived from a ^.rynefom, ''^f^um. the poVroK^ 
Sde derived from a mutant of the coornefom, bacterium or the polynucleotide to be ^^"'"f ^^^^ r=hSfi 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid, and analogues thereof. ^. , _ r-„„fc„„w,/o ^ii 

(5) The method according to (1). wherein the polynucleotide to be examined is derived from EschernMa coU. 

(6) A polynucleotide array, comprising: 

at least two polynucleotides selected from the group consisting of f l«t polynucleotides «^'"Prt*'"9 *e n^^ 
otide sequeL represented by any one of SEQ ID NOS:1 to 3501 . secorid polynucleotides wWch hybridbe 
20 with the first polynucleotides under stringent conditions, and third polynucleotides comprising 10 to 200 con- 

tinuous bases of the first or second polynucleotides, and 
a solid support adhered thereto. 

As used herein, for example, the at least two polynucleotides can be at least two of the P°'y""f 
25 at least two of the second polynucleotides, at least two of the third polynucleotides, or at least two of the first. 

tXJ-?-«S:SS^he nucleotide 

rsrA'^o^uc^^i™ 

30 a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

(9) A poVnudeotlde encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931 . or a polynucleotide which hybridizes therewith under stringent conditions. 

(10) A polynucleotide which is present in the 5' upstream or 3' downstream of a 

nucleottde sequence of any one of SEQ ID NOS:2 to 3431 in a whole polynucleotide '^^^^^^^^^ 
35 seauence represented by SEQ ID NO:1 . and has an activity of regulating an expression of the polynucleotKte. 

mTrp^^SSteotlde comprising 1 0 to 200 continuous bases in the nucleotide sequence of the polynucleotide^ 
any one of (7) to (10). or a polynucleotide comprising a nucleotide sequence complementary to the polynucleotide 
comprising 10 to 200 continuous based. 

(1 2) A recombinant DNA comprising the polynucleotide of any one of (8) to (11 ). 
40 (13) A transfomiant comprising the polynucleotide of any one of (8) to (11) or the recombinant DNA of (12). 

(14) A method for producing a polypeptide, comprising: 

culturing the transfomiant of (13) In a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of (8) or (9) in the medium, and 
45 recovering the polypeptide from the medium. 

(1 5) A method for producing at least one of an amino acid, a nudeic acid, a vitamin, a saccharide, an organic add. 
and analogues thereof, comprising: 

50 culturing the transfonnant of (13) in a medium to produce and accumulate at le^ one of an amino add. a 

nudeic add. a vitamin, a saccharide, an organic add. and analogues thereof in the ""^dl^um. and 
recovering the at least one of the amino acid, the nudeic add. the vitamin, the saccharide, the organic add. 
and analogues thereof from the medium. 

55 (1 6) A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS: 

2 to 3431 

(1 7) A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931 . 

(18) The polypeptide a«x>rding to (16) or (17). wherein at least one amino acid is deleted, replaced. Inserted or 
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added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 
at least one amino acid deletion, replacement, insertion or addition, 

(19) A polypeptide comprising an amino acid sequence having a homology of at least 60% with the amino acid 
sequence of the polypeptide of (16) or (17), and having an activity which Is substantially the same as that of the 
polypeptide. 

(20) An antibody which recognizes the polypeptide of any one of (16) to (19). 

(21) A polypeptide array, comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of (16) to (19) and 
partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

(22) A polypeptide array, comprising: 

at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 
tides of (1 6) to (1 9) and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

(23) A system based on a computer for identifying a target sequence or a target structure motif derived from a 
coryneform bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence Information selected from SEQ ID NOSrI 

to 3501 , and target sequence or target structure motif information; 

(II) a data storage device for at least temporarily storing the input Information; 

(Hi) a comparator that compares the at least one nucleotide sequence Infonnation selected from SEQ ID NOS: 
1 to 3501 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence information which Is coincident with or analogous to the 
target sequence or target structure motif information; and 

(iv) an output device that shows a screening or analyzing result obtained by the comparator. 

(24) A method based on a computer for Identifying a target sequence or a target structure motif derived from a 
corynefomi bacterium, comprising the following: 

(i) inputting at least one nucleotide sequence infonnation selected from SEQ ID NOS:1 to 3501, target se- 
quence infonnation or target structure motif information into a user input device; 

(ii) at least temporarily storing said infomiation; 

(ill) comparing the at least one nucleotide sequence Information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif Information; and 

(iv) screening and analyzing nucleotide sequence Infonnation which Is coincident with or analogous to the 
target sequence or target structure motif information. 

(26) A system based on a computer for Identifying a target sequence or a target structure motif derived from a 
corynefonn bacterium, comprising the following: 

(1) a user Input device that Inputs at least one amino acid sequence information selected from SEQ ID NOS: 

3502 to 7001 , and target sequence or target structure motif Infonmation; 

(ii) a data storage device for at least temporarily storing the input Infonnation; 

(ill) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif information, recorded by the data storage 
device for screening and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif infonnation; and 

(iv) an output device ttiat shows a screening or analyzing result obtained by the comparator. 

(26) A method based on a computer for Identifying a target sequence or a target structure motif derived from a 
corynefonm bacterium, comprising the following: 

(0 inputting at least one amino acid sequence information selected from SEQ ID NOS:3502 to 7001 , and target 
sequence information or target structure motif information into a user input device; 
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target sequence or target structure motif Information, 
sequence information; 

nucleotide having at least one nucleotide sequence selected from SEQ ID NOS2 to 3501 . ana 
(IV) an output devices that shows a function obtained by the comparator. 

following: 

the target nucleotide sequence infomiatlon; and .^.^^tiHo havina the taraet nucleotide sequence 

from SEQ ID NOS:2 to 3501 . 
(29)Asystem based onacomputerfordetem^iningafunctionofapolypeptidehaving 
derived from a coryneform bacterium, comprising the following: 

(0 a user Input dev^e that Inputs at least -Jj^-^ ^^^^^^^^^ 

3502 to 7001 , function Infomnation based on the amino acid sequence, ana largex ami 



one Ino acW sequence selected from SEQ ID NOS:^02 to and 

(Iv) an output device that shows a function obtained by the comparator. 
(30)Amethodbasedonacomputerfordetem,lnlngafunctlonofapolypeptldehavlngata,getamlnoacld8equence 
derived from a coryneform bacterium, comprising the following: 

(0 inputting at least one amino acid sequence intom^atlon selected from SEQ ID NOS|3502 to 7001 . function 

£0^0? based on the amino acid sequence, and target amino acd sequence mforniafon. 

(ii) at least temporarily storing said Information; , ...-cted from SEQ ID NOS:3502 to 7001 

(ill) comparing the at least one amino acid sequence information selected from 11^ 

With the target amino acid «f '"'f.^.^^""^ ^."^ t^-.^ ^mino acid sequence which is coincident with or 



7001. 



(31 ) The system according to any one of (23). (25). (27) and (29). wherein a corynefom, bacterium is a mlcroor- 
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ganism of the genus Corynebacterium, the genus Brevlbacterium, or the genus Microbacterium. 

(32) The method according to any one of (24). (26), (28) and (30), wherein.a coiyneform bacterium Is a microor- 
ganism of the genus Corynebacterium, the genus Brevibacterfum, or the genus Microbacterium, 

(33) The system according to (31 ), wherein the microorganism belonging to the genus CorynebacteHum Is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoaciOophitum, Corynebacterium 
acetogtutamicum, corynebacterium cailunae, corynebacterium hercuiis, Corynebacterium iiiium, Corynebacterium 
meiassecoia, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(34) The method according to (32), wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium giutamicum, Corynebacterium acetoacidophiium, Corynebacterium 
acetogtutamicum, Corynebacterium caiiunae, Corynebacterium hercuiis, Corynebacterium iiiium, Corynebacteri- 
um meiassecoia, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

(35) A recording medium or storage device which Is readable by a computer In which at least one nucleotide 
sequence Infomriation selected from SEQ ID NOS:1 to 3501 or function infomiation based on the nucleotide se- 
quence is recorded, and is usable in the system of (23) or (27) or the method of (24) or (28). 

(36) A recording medium or storage device which is readable by a computer In which at least one amino acid 
sequence information selected from SEQ ID NOS:3502 to 7001 or function Infomnatlon based on the amino acid 
sequence Is recorded, and is usable in the system of (25) or (29) or the method of (26) or (30). 

(37) The recording medium or storage device according to 

(35) or (36), which Is a computer readable recording medium selected from the group consisting of a floppy disc, 
a hard disc, a magnetic tape, a random access memory (RAM), a read only memory (ROM), a magneto-optic disc 
(MO), CD-ROM, CD-R, CD-RW, DVD-ROM, DVD-RAM and DVD-RW. 

(38) A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the 
Vai residue at the 59th In the amino acid sequence of homoserine dehydrogenase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a \^ residue. 

(39) A polypeptide comprising an amino acid sequence in which the Val residue at the 59th position In the amino 
acid sequence as represented by SEQ ID NO:6952 Is replaced with an amino acid residue other than a Val residue. 

(40) The polypeptide according to (38) or (39), wherein the Vat residue at the 59th position is replaced with an Ala 
residue. 

(41) A polypeptide having pyruvate cartDoxylase activity, comprising an amino acid sequence In which the Pro 
residue at the 468th position in the amino acid sequence of pyruvate carboxylase derived from a coryneform 
bacterium is replaced with an amino acid residue other than a Pro residue. 

(42) A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position in the amino 
acid sequence represented by SEQ ID NO:4265 Is replaced with an amino acid residue other than a Pro residue. 

(43) The polypeptide according to (41 ) or (42), wherein the Pro residue at the 458th position Is replaced with a Ser 
residue. 

(44) The polypeptide according to any one of (38) to (43), which is derived from Corynebacterium giutamicum. 

(45) A DNA encoding the polypeptide of any one of (38) to (44). 

(46) A recombinant DNA comprising the DNA of (45). 

(47) A transfomiant comprising the recombinant DNA of (46). 

(48) A transfonnant comprising In Its chromosome the DNA of (45). 

(49) The transformant according to (47) or (48), which Is derived from a corynefonn bacterium. 

(50) The transformant according to (49). which is derived from Corynebacterium giutamicum. 

(51) A method for producing L-lyslne, comprising: 

^jturlng the transformant of ariy one of (47) to (50) in a medium to produce and accumulate L-lyslne In the 
mediunri.and 

recovering the L-lyslne from the culture. 

(52) A method for breeding a coryneform bacterium using the nucleotide sequence Infonmatlon represented by 
SEQ ID NOS:1 to 3431 , comprising the following: 

(I) comparing a nucleotide sequence of a genome or gene of a production strain derived a coryneform bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 
an amino acid, a nucleic acid, a vitamin , a saccharide, an organic add, and analogous thereof by a fenmentation 
method, with a corresponding nucleotide sequence In SEQ ID NOS:1 to 3431; 

(II) Identifying a mutation point present in the production strain based on a result obtained by (i); 

(iii) introducing the mutation point Into a coryneform bacterium which is free of the mutation point; and 

(Iv) examining productivity by the fermentation method of the compound selected In (0 of the coryneform 
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bacterium obtained in 010- 

(53) The method according to (52). wherein the gene is a gene encoding an enzyme In a biosynthetic pathway or 

r^rheTercS'SioCng tJ (52). wherein the mutation point is a mutation point relating to a useful mutation 

which improves or stabilizes the productivity. ,»„rM«nf««i hu 

(55) A method for breading a corynefomi bacterium using the nucleotide sequence information represented by 

SEQ ID NOS:1 to 3431 , comprising: 

(0 comparing a nucleotide sequence of a genome or gene of a production strain derived a confnefomi ba^e- 
rium which has been subjected to mutation breeding so as to produce at least one compound sf^cted f^om 
an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by afemientation 
method, with a corresponding nucleotide sequence In SEQ ID NOS:1 to 3431 ; , . 

(ii) identifying a mutation point present in the production strain based on a result obtain by (i): 

(iii) deleting a mutation point from a corynefomi bacterium having the mutation point; and 

(iv) examining productivity by the fermentation method of the compound selected in (i) of the coryneform 
bacterium ot>tained in (ill). 

(56) The method according to (55). wherein the gene is a gene encoding an enzyme in a biosynthetic pathway or 

a signal transmission pathway. , ^ ^, ^, M^Mii-^aa 

(57) The method according to (55). wherein the mutation point Is a mutation point which decreases or destabilizes 

(58) TmSilld for breeding a coiynefomi bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:2 to 3431 . comprising the following: 

(1) identifying an isozyme relating to biosynthesis of at least one compound selected from an « 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence infomiation represented by SEQ ID NOS2 to 3431 ; 

(ii) classifying the Isozyme identified in (i) Into an isozyme having the same activity; 

(iii) mutating all genes encoding the Isozyme having the same activity simultaneous^ and 

(iv) examining productivity by a fermentation method of the comjsound selected In (I) of the corynefomi bac- 
terium which have been transfonnod with the gene obtained in (iii). 

(59) A method for breeding a coryneform bacterium using the nucleotide sequence information represented by 
SEQ ID NOS:2 to 3431 , comprising the following: 



(i) arranging a function Infomnation of an open reading frame (ORF) represented by SEQ Nf &a to ^1; 

(ii) allowing the arranged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 

Om'Scatlng an unknown biosynthesis pathway or signal transmission pathway of a coiyneform bacterium 
in comblnatton with Infomiation relating known biosynthesis pathway or signal transmission pathway of a co- 
ryneform bacterium; . , . . , 

(iv) comparing the pathway explicated in (ill) with a biosynthesis pathway of a target use ul P~*;«^: f "f 
(V) transgenetically varying a corynefomi bacterium based on the nucleotide sequence Information o etther 
strengthen a pathway which is judged to be Important In the biosynthesis of the target useful P~<l"f C^Vf/ 
weaten a pathway which is judged not to be Important In the biosynthesis of the target useful product In (iv). 

(60) A coryneform bacterium, bred by the method of any one of (52) to (59). 

(61) The coryneform bacterium according to (60). which is a mtoroorganism belonging to the genus Conmebac- 
terium, the genus erw/bacferii//n, or the genus M/crabactenfom. . . r^^ru^^^ 

(62) The corynefom, bacterium according to (61). wherein the microorganism betongingto the genus Cojynebac- 
teriumis selected from the group consisting of Corynebacterium glutamicum. ^'V^'^'"^''^^^'^^: 
Corynebaderium acetoglutamicum, Corynebacterium callunae. Corynebacterium herculls. Corynebacterium lil- 
lum Corynebacterium melassecola. Corynebacterium thermoaminogenes, and Corynebactenum ammoniagenes. 

(63) A method for producing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a 
saccharide, an organic add and an analogue thereof, comprising: 

culturing a corynefonn bacterium of any one of (60) to (62) in a medium to produce and accumulate at least 
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one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof; 

recovering the compound from the culture. 

(64) The method according to (63), wherein the compound is L-lysine. . 

(65) A method for Identifying a protein relating to useful mutation based on proteome analysis, comprising the 
following: 

(i) preparing 

a protein derived from a bacterium of a production strain of a coryneform bacterium wiiich has been sub- 
jected to mutation breeding by a fenmentation process so as to produce at least one compound selected 
from an amino acid, a nucleic add, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (i) by two dimensional electrophoresis; 

(III) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

(iv) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained In (Iv); and 

(vi) comparing the amino acid sequences obtained In (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

As used herein, the temi "proteome", which is a coined word by combining "protein" with "genome", refers to 
a method for examining of a gene at the polypeptide level. 

(66) The method according to (65), wherein the coryneform bacterium is a microorganism belonging to the genus 
Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

(67) The method according to (66), wherein the microorganism belonging to the genus Corynebacterium Is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacfdophilum, Corynebacterium - 
Bcetoglutamicum, Corynebacterium callunae, corynebacterium hercuiis, Corynebacterium liiium Corynebacterium 
mefassecoia, Corynebacterium thermoamlnogenes, and Corynebacterium ammoniagenes. 

(68) A biologically pure culture of Corynebacterium glutamtoum AHP-3 (PERM BP-7382). 

[001 8] The present invention will be described below in more detail, based on the determination of the full nucleotide 
sequence of coryneform bacteria. 

1 . Detemilnation of full nucleotide sequence of coryneform bacteria 

[0019] The temi "oorynefonrn bacteria" as used herein means a mtoroorganlsm belonging to the genus Corynebac- 
terium, the genus Breyfbacterium or the genus Mlcrot>acterium as defined in Bergeys Manual of Determinative Bacte- 
riology, a. (1974). 

[0020] Examples include Corynebacterium acetoacidopNIum, Corynebacterium acetoglutamicum, Corynebacterium 
callunae, Corynebacterium glutamicum, Corynet^acterium hercuiis, Corynebacterium liiium, Corynebacterium melas- 
secola, Corynet>acterium thermoaminogenes, Brevibacterium saccharolytlcum, Brevibacterium immark^hilum, Brevi- 
bacterium roseum, Brevibaderium thiogenitalis, Mlcrc^acterium ammonlaphllum, and the lllce. 
[0021 ] Specific examples include Corynebacterium acetoaddophilum ATCC 1 3870, Corynebacterium acetoglutaml- 
cumACTCC 15806, Corynebacterium callunae fiJCC 15991, Corynebacterium glutamicum AJCC 13032, Corynebac- 
terium glutarrtlcum ATCC ^3060, Corynebacterium glutamicum fierce 13826 (prior genus and species: Brevibacterium 
flavum, or Corynebacterium lactofermentum), Corynebacterium glutamicum ATCC 14020 (prior genus and species: 
Brevibacterium divaricatum), Corynebacterium glutamicum ATCC 1 3869 (prior genus and species: Brevibacterium 
lactofermentum), Corynebacterium hercuiis ATCC 13868, Corynebacterium liiium ATCC 15990, Corynebacterium 
melassecola ATCC 17966, Corynebacterium thermoaminogenes PERM 9244, Brevibacterium saccharolytlcum fiTCC 
14066, Brevibacterium immariophilum ATCC 14068, Brevibacterium roseum ATCC 13825. Brevibacterium thiogenitalis 
ATCC 19240. Microbacterium ammonlaphllum ATCC 15354, and the like. 
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(1 ) Preparation of genome DNA of corynef orm bacteria 

[0022] coryneform bacteria can be culttfred by a conventional method. „«rt,„„ suitable for efficient 

[0023 Any^of a natural medium and a synthetic medium can be used, so long as it « "^^^^'^^^^'^^ J 
^miring of the microorganism, and It contains a carbon source, a nitrogen source, an inorganic salt, and the U 

T^i TnC.iSc'lirJrSZSifor example, a BY medium (7 g. meat extract^O PePtone. 3 g[U«^^^^ 
chtoiSe. 5 if^ast extract. pS 7.2) containing 1% of glycine and the like can be used. The cultunng is carried out at 

[oV25rAfter';;i'Smpletion of the cutture. the cells are recovered from the culture by cerrtrifugation. The resulting 

^rrra^^sofLTaS^^^^^^^ 

no^DNA ZelJ iSg the cell wall of the cells using a isozyme and a surfactant (SDS etc.) "9 P~«2! 

anJ^hellte urng a phenol solution and a phenol/chlorofomi solution, and then precipitating the genome DNA vyfth 
ethanoi or the like. Specifically, the following method can be Illustrated. .k^w™ r t« 

ToOM? The washed cells arl suspended In a washing solution containing 5 to 20 mg/l iysozyme. After shaking. 5 to 
STiosTaddSSlyse the cells in usual. Shaking is gently perfom,ed at 2^ 

shaklna the suspension Is maintained at 60 to 70»C for 5 to 1 5 minutes for the lysis. ^ „ ^ „h-,««i 

[oSS ISer tS lysis, the suspension is cooled to ordinary temperature, and 5 to 20 mi of Tris-neutralized phenol is 
aiMed thereto followed by gently shaking at room temperature for 1 5 to 45 minutes. 

foMOl M^shaWnr^^^^ (15 000 x g. 20 minutes. 20-C) is carried out to fractionate the aqueous layer. 

SSa? Me peSoK^xtraSion wih phenolfehlorofom, and extraction with chlorofom. (twtee) In the sarne mannen 
3 mo Ji solm acetate solution (pH 5.2) and Isopropanol are added to the aqueous layer «t J/10 ^mes voiu^^ and 2 
«mes volume of the aqueous layer, respectively, followed by gently stirring to preciprtate the genome DNA. 
SSlT Z'^-nleD^A is dissoWed again In a buffer containing 0.01 to 0.04 m^i "^a^;^. ^" 'IJ,^ 
buffer TE buffer (10 mmol/i Tris hydrochloride. 1 mol/l EDTA. pH 8.0) can be used, ^fter dissoivina *e resulte^ 
sSJLfemLtainedat25to40-Cfor20toS0minute8andthenextracted8uccessiveiywith phenol. phenoV^ 

precipitation Is carried out and the resulting ONA pr^Ha. is washed wW, 
70% ethanoi. foltowed by air drying, and then dissolved in TE buffer to obtain a genome DNA solution. 

(2) Production of shotgun library 

100341 A method for produce a genome DNA library using the genome DNA of the coryneform bacteria P/eP«ed in 

ELo^SeaLhoddeLbedlnMo/ecafarCto^^^^^ 

*teTedtoa;-Matecuteraon/np.2nded.").inparUcular,thef^ 

DNA llbraw appropriateV usable in determining the full nucleotide sequence by the shotgun method. 
lS^ 7o S'o'l rSg of the genome DNA of the corynefom, bacteria prepared in the above (1) ; a bu^ e^ s^^^^ as TE 
buffer or the like, is added to give a totei volume of 0.4 ml. Then, the genome DNA is digested Jnto fragmente of 1 to 
ToZ With a sonkitor (Yamato Powersonte Model 50). The treatment wtth the sonicator Is periom,ed at an output of 

SlS^r^^erLV^^^^^^^^^ 

S5?l"'*^he blunt-ended genome fragmente are fractionated by agarose gel or poVacrylamide gel electrophoresis 
and aenome fraaments of 1 to 2 kb are cut out from the gel. 

foosir To the gS S to 0.5 mi of a bufferfor eluting DNA. such as MG elution buffer (OJ "^^^^'T^^Z^^A 
10 mmol/1 magnesium acetate. 1 mmoifl EDTA. 0.1% SDS) or the like. Is added, followed by shaking at 25 to 40 C 

pM9rT^e''r^uSnJ DNA eluate Is treated with phenol/chiorofom, and then precipitated with ethanoi to obtein a 

SliT'SLertTsligatedimoasuitablevector.suchaspU 

SSSh) ^thL tike. usfng T4 ilgase (manufactured by Takara Shuzo) or the like. The ligatton can be carried out by 
allowing a mixture to stand at 10 to 20«C for 20 to 50 hours. on., i niTF buffer 

r004n The resulting ligation product is predpiteted with ethanoi and dissolved in 5 to 20 ^1 of TE buffer 
K SceSaU IS transfom,ed I accordance wtth a conventional a^,^^^^^ 'gJ^S 

solution. Examples of the transfomiatlon method include the electroporatlon method using ELECTRO MAX DHIOB 
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(manufactured by Life Technologies) for Escherichia colL The electroporatlon method can be carried out under the 
conditions as described in the manufacturer's Instnjctlons. 

[0043] The transf omied Escherichia coll Is spread on a suitable selection medium containing agar, for example LB 
plate medium containing 10 to 100 mgf\ amplclllln (LB medium (10 g/l bactotrypton. 5 g/I yeast extract. 10 g/l sodium 
chloride, pH 7.0) containing 1.e% of agar) when pUCIB Is used as the cloning vector, and cultured therein. 
[0044] The transfomiant can be obtained as colonies fomned on the plate medium, in this step. It Is possible to select 
the transfonnant having the recombinant DNA containing the genome DNA as white colonies by adding X-gai and 
IPTG (Isopropyi-p-thlogalactopyranoslde) to the plate medium. 

[0045] The transfonnant is allowed to stand for culturing In a 96-well titer plate to which 0.05 ml of the LB medium 
containing 0.1 mg/ml of amplclllln has been added In each well. The resulting culture can be used In an experiment of 
(4) described below. Also, the culture solution can be stored at -80»C by adding 0.05 ml per well of the LB medium 
containing 20% glycerol to the culture solution, followed by mixing, and the stored culture solution can be used at anv 
time. ' ' 



(3) Production of cosmid library 



[0046] The genome DNA (0.1 mg) of the corynefonn bacteria prepared In the above (1) is partially digested with a 
restriction enzyme, such as Sai^AI or the like, and then ultracentrlfuged (26.000 rpm, 18 hours, 20»C) under a 10 to 
40% sucrose density gradient using a 1 0% sucrose buffer (1 mol/l Nacl. 20 mmol/l Tris hydrochloride. 5 mmol/l EDTA, 
1 0% sucrose, pH 8.0) and a 40% sucrose buffer (elevating the concentration of the 1 0% sucrose buffer to 40%). 
[0047] After the centrlfugatlon, the thus separated solution is fractionated Into tubes in 1 ml per each tube. After 
conf Imiing the DNA fragment size of each fraction by agarose gel electrophoresis, a fraction rich In DNA fragments of 
about 40 l<b Is precipitated with ethanoi. 

[0048] The resulting DNA fragment Is llgated to a cosmid vector having a cohesh^e end which can be ligated to the 
fragment. When the genome DNA Is partially digested with SauSAt. the partially digested product can be ligated to, 
for example, the BamHI site of superCosI (manufactured by Stratagene) in accordance with the manufacture's instruc- 
tions. 

[0049] The .resulting ligation product is packaged using a packaging extract whfch can be prepared by a method 
described in Mpiecular Clerking, 2nd ed. and then used in transfomiing Escherichia coll. More specifically, the ligation 
product Is packaged using, for example, a commercially available packaging extract. Gigapack III Gold Packaging 
Extract (manufactured by Stratagene) In accordance with the manufacture's Instructions and then introduced into Es- 
cherichia CO// XL-1-Bluei\^R (manufactured by Stratagene) or the like. 

[0050] The thus transfomied EscherkO^la coli ia spread on an LB plate medium containing amplclilin. and cultured 
therein. 

[0051] The transformant can be obtained as colonies formed on the plate medium. 

[0052] The transformant Is subjected to standing culture In a 96-well titer plate to which 0.05 ml of the LB medium 
containing 0.1 mgAnI amplciliin has been added. 

[0053] The resulting culture can be employed in an experiment of (4) described below. Also, the culture solution can 
be stored at -SO'C by adding 0.05 ml per well of the LB medium containing 20% glycerol to the culture solution, followed 
by mixing, and the stored culture solution can be used at any time. 

(4) Determination of nucleotide sequence 



(4-1) Preparation of template 



[0054] -The full nucleotide sequence of genome DNA of corynefonn bacteria can be detenmlned basically according 
to the whole genome shotgun method (Science, 269: 496-512 (1995)), 

[0055] The template used In the whole genome shotgun method can be prepared by PGR using the library prepared 

In the above (2) {DNA Research, S: 1-9 (1998)). 

[0056] Specifically, the template can be prepared as follows. 

[0057] The clone derived from the whole genome shotgun library is inoculated by using a replicator (manufactured 
by GENETIX) into each well of a 96-well plate to which 0.08 ml per well of the LB medium containing 0.1 mg/ml ampteillln 
has been added, followed by stationarily culturing at 3T*C overnight. 

[0058] Next, the culture solution Is transported, using a copy plate (manufactured by Tokken). into each well of a 
96-well reaction plate (manufactured by RE Biosystems) to which 0.025 ml per well of a PGR reaction solutton has 
been added using TaKaRa Ex Taq (manufactured by Takara Shuzo). Then. PGR Is earned out In accordance with the 
protocol by Makino et al. {DNA Research, S: 1-9 (1998)) using GeneAmp PGR System 9700 (manufactured by PE 
Biosystems) to amplify the Inserted fragments. 
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[00591 The excessive primers and nucleotides are eliminated using a Ml for purifying a PGR product, and the product 
is used as the template in the sequencing reaction. , „„ » »a»,„iato 

[0060] It is also possible to detemiine the nucleotide sequence using a double-stranded DNA piasmid a template. 

0061 The double-stranded DNA plasmid used as the template can be obtained by the »«>'^7"97««^'J-^ 

0062 Theclonederivedfromthewholegenomeshotgunlibraryisinoculatedirtoeac*weilofa24-or9^^^^^ 

!o which 1 .5 ml per well of a 2 x YT medium (16 g/i bactotrypton. 10 g/1 yeast extract. 6 g^sodium chlonde. pH 7.0) 

containing 0.05 mg/ml ampicillln has been added, followed by cutturing under shaldng at 37»C ovemlght. 

fooS The double-stranded DNA plasmid can be prepared from the culture solution using an 

pre^g machine KURABO PI-50 (manufactured by Kurabo Industries), a multiscreen (manufactured by Mill.pore) 

or the nice, according to each protocol. „t,„,.„-H 

[00641 To purify the plasmid. Biomeic 2000 manufactured by Beckman Coulter and the like can be used. 

[0065] The resulting purified double-stranded DNA plasmid is dissolved in water to gwe a concentration of about 0.1 

mg/hil. Then, it can be used as the template in sequencing. 

(4-2) Sequencing reaction 

[0066] The sequencing reaction can be carried out according to a commercially available sequence kit orthe like. A 
specific method is exemplified betow. „ b _ Lfi. f=.^..ro»4 

[0067] To 6 til of a solution of ABI PRISM BigDye Temiinator Cycle Sequencing Rea,^ Reaction •^a"ufartured 
Ly PE Blosystems). 1 to 2 pmol of an 1^13 regular direction prh,er (M13-21) or "^1 3 revere cjrec i^^^^^^ 
(MI3REV) (DAM Research. 5: 1-9 (1998)) and SO to 200 ng of the template prepared in the above (4-1) (the PGR 
product or plasmid) to gh/e 10 Hi of a sequencing reaction solution. , »-Hr.«n«Amn 

[00681 AdyeterrninatorsequencingreacUon(35to55cycles)iscarriedoutusingth,sreactionsoiufton^^^^ 
PCRSy8tem0700(manufa^redbyPEBiosy8tems)orthelike.Thecycleparametercanbedete^^^^^^ 
wS^al^mmercially available kit. for example, the manufacture's instructions attached with ABI PRISM B.g Dye Ter- 
minator Cycle Sequencing Ready Reaction Kit. imi ^i^*^ i„^an,t 

E!o69] The sample can be purified using a commercially available product, such as Muiti Screen HV plate (manu- 
factured by Miilipore) or the like, according to the manufacture's instructions. „K,.,„ Th-HrioH 
[00701 The thus purtfied reaction product is precipitated with ethanoi. dried and then usedforthe anaiysra. The dned 
reaction product can be stored in the daric at -30-C and the stored reaction product can be used at any Hme. 
[00711 The dried reaction product can be analyzed using a commercially available sequencer and an analyzer ac 

ssr^Exrsreirhei::;;;^^^^^^ 

by PEBtosystems). Exampieof the analyzer include ABI PRISM 3700DNAAnalyzer(manufacturedby PE Biosystems). 
(S) Assembly 

[00731 A software, such as phred (The University of Washington) or the like, can be t*^*^' '"l 

analyzing the sequence infomiation obtained in the above (4). A software, such as Cross Match (Th« U^^rZ^k 
Washh^on) or SPS Cross.Malch (manufactured by Southwest Parallel Software) or the like, can be used to mask 

ro;7TFrre::rb" a"::itware. such as phrap (The Univers«y of Washington). SPS phrap (manufactured by 
Southwest Parallel Software) or the like, can be used. or< M»^i«t«eh «nritha 

[0075] In the above, analysis and output of the results thereof, a computer such as UNIX. PC. Macintosh, and the 

[OOwT ConTg'^obtained by the assembly can be analyzed using a graphical editor such as consed (The University 

[iiJ^ *''Klo rossNe to perfom, a series of the operations f mm the base call to the assembly in a lump using a 
script phredPhrap attached to the consed. 

[0078] As used herein, software will be understood to also be referred to as a comparator. 
(6) Determination of nucleotMe sequence in gap part 

[00791 Each of the cosmids in the cosmid library constructed in the above (3) Is prepared in the same nia""sr as in 
the preparation of the double-stranded DNA plasmid described in the above (4-1 ). T^ 

of the insert fragment of the cosmid is detem,ined using a commercially available kit, such ^^Bl PRISM agl^ve 
Temiinator Cycle Sequencing Ready Reaction Kit (nnanufactured by PE Blosystems) according to the manufactures 
instructions. 
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[0080] About 800 cosmid clones are sequenced at both ends of the Inserted fragment to detect a nucleotide sequence 
In the contig derived from the shotgun sequencing obtained In (5) which is coincident with the sequence. Thus, the 
chain linkage between respective cosmid clones and respective contig are clarified, and mutual alignment Is carried 
out. Furthemnore, the results are compared with known physical maps to map the cosmlds and the contlgs. In case of 
5 Corynebacterium gtutamlcum PCTCC 1 3032, a physical map of Moi. Gen. Genet, 252: 255*265 (1 996) can be used. 
[0081] The sequence in the region whteh cannot be covered with the contlgs (gap part) can be determined by the 
following method. 

[0082] Clones containing sequences positioned at the ends of the contlgs are selected. Among these, a clone wherein 
only one end of the Inserted fragment has been determined is selected and the sequence at the opposite end of the 
10 inserted fragment is detemriined. 

[0083] A shotgun library clone or a cosmid clone derived therefrom containing the sequences at the respective ends 
of the inserted fragments in the two contlgs is identified and the full nucleotide sequence of the inserted fragment of ' 
the clone Is determined. 

[0084] According to this method, the nucleotide sequence of the gap part can be determined. 

15 [0085] When no shotgun library clone or cosmid clone covering the gap part Is available, primers complementary to 
the end sequences of the two different contlgs are prepared and the DNA fragment In the gap part Is amplified. Then, 
sequencing is performed by the primer walking method using the amplified DNA fragment as a template or by the 
shotgun method in whteh the sequence of a shotgun clone prepared from the amplified DIMA fragment Is determined. 
Thus, the nucleotide sequence of the above-described region can be detemnined. 

20 [0086] In a region showing a low sequence accuracy, primers are synthesized using AUTOFINISH function and 
. NAVIGATING function of consed (The University of Washington), and the sequence is determined by the primer walking 
method to improve the sequence accurEicy. 

[0087] Examples of the thus detemilned nucleotide sequence of the full genome include the full nucleotide sequence 
of genome of Corynebacterium glutamlcum ATCC 1 3032 represented by SEQ ID NO:1 . 

25 

(7) Detemiination of nucleotide sequence of microorganism genome DNA using the nucleotide sequence represented 
bySEQIDNOil 

[0088] A nucleotide sequence of a polynucleotide having a homology of 80% or more with the full nucleotide sequence 
30 of Corynebacterium g/utamicum ATCC 1 3032 represented by SEQ ID NO:1 as determined above can also be deter- 
mined using the nucleotide sequence represented by SEQ ID N0:1, and the polynucleotide having a nucleotide se- 
quence having a homology of 80% or more with the nucleotide sequence represented by SEQ ID NO:1 of the piBsent 
inventton is within the scope of the present invention. The term "polynucleotide having a nucleotide sequence having 
a homology of 80% or more with the nucleotide sequence represented by SEQ ID NO:1 of the present invention" Is a 
35 polynucleotide in which a full nucleotide sequence of the chromosome DNA can be detennined using as a primer an 
oligonucleotide composed of continuous 5 to 50 nucleotides in the nucleotide sequence represented by SEQ ID NO: 
1 , for example, according to PCR using the chromosome DNA as a template. A particularly prefen^ed primer In deter- 
mination of the full nucleotide sequence Is an oligonucleotide having nucleotide sequences which are positioned at 
the Interval of about 300 to 500 bp. and among such oligonucleotides, an oligonucleotide having a nucleotide sequence 
40 selected from DN As encoding a protein relating to a main metabolic pathway is particularly preferred. The polynucle- 
otide In which the full nucleotide sequence of the chromosome DNA can be determined using the oligonucleotide 
Includes polynucleotides constituting a chromosome DNA derived from a mteroorganism belonging to coryneform bac- 
teria. Such a polynucleotide is preferably a polynucleotide constituting chromosome DNA derived from a microorganism 
belonging to the genus Corynebacterium, more preferably a polynucleotide constituting a chromosome DNA of Co- 
rynebacterium glutamJcum. 



2. Identification of ORF (open reading frame) and expression regulatory fragment and detemiination of the functton of 
ORF 



[0089] Based on the full nucleotide sequence data of the genome derived from corynefonrn bacteria determined in 
the above Item 1, an ORF and an expression modulating fragment can be identified. Furthermore, the function of the 
thus detenmlned ORF can be detennined. 

[0090] The ORF means a continuous region In the nucleotide sequence of mRNA which can be translated as an 
amino acid sequence to mature to a protein. A region of the DNA coding for the ORF of mRNA is also called ORF 
[0091] The expression modulating fragment (hereinafter referred to as **£MP) is used herein to define a series of 
polynucleotide fragments whteh modulate the expression of the ORF or another sequence ligated operatabiy thereto. 
The expression "modulate the expression of a sequence ligated operatabiy" is used herein to refer to changes in the 
expression of a sequence due to the presence of the EMF. Examples of the EMF include a promoter, an operator, an 
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enhancer, a silencer, a ribosome-binding sequence, a transcriptional termination sequence, and the like Jn c^^^ 
bacteria, an EMF Is usually present in an intergenic segment (a fragment positioned between two gen^^; ^° 
200 nucleotides in length) Accordingly, an EMF is frequently present in an intergenic segment of 10 ""c'^t'-^^t,! 
longer.lt is also possibte to detemiine or discoverthe presence of an EMFby using ImownEMFseq^^^^ 

sequence or a target stmctural motif (or a target motif) using an appropriate software or comparator. as FASTA 
IZc. Natl. Aca^Sd. USA. 85: 2444-48 (1988)). BUVST (J. Mol. Biol.. 21S.Aa^^0 (1990)) orthe like. Also, rt <^ 
be identified and evaluated using a known EIWIF-capturing vector (for example. pKK232-8; manufactured by Amersham 

roSS^^e teS'narget sequence" is used herein to refer to a nucleotide sequence composed of 6 or more nucle- 
otides, an amino acid sequence composed of 2 or more amino acids, or a nucleotide sequence «"«^'''"9 
acid sequence composed of 2 or more amino acids. A longer target sequence appeara at ^f!^""^^™ 
L lower possibility The target sequence is preferably about 1 0 to 1 00 amino acid residues or about 30 to 300 nucle- 

[OMar'Thrtemi "target structural motiT or "target motif" is used herein to refer to a sequence or a combination^ 
sequences selected optionally and reasonably. Such a motif is selected on the basis of the threedimensional structure 
fo^ed by the folding of a polypeptide by means known to one of ordinary skill in the art. Various moth/es are known. 
[0094] Examplesofthetargetmotifofapolypeptldelnclude.butarenotllmitedto anenzymeac^^,^ 
protein Interactton site, a signal sequence, andthe like. Examples of the target motif of a nudeic acid Include a promoter 
sequence, a transcriptional regulatory factor binding sequence, a hair pin structure, and the like. _ . 

[00951 Examples of highly useful ElVIF Include a high-expression promoter, an lnduclble-e)^re8sion promoter, and 
ttie like. Such an EMF can be obtained by posltlonaliy detemiining the nucleotide sequence of a gene ^^teh s toow^^ 
or expected as achieving high expression (for example, ribosomal RNA gene; GenBank Accession No. MIBI^or 
Z46753) or a gene showing a desired induction pattern (for example, Isocitrate lyase gene induced by acetic add. 
Japanese Published Unexamined Patent Application No. 56782/93) via the alignment with the full genorne nucleoWde 
sequence detemilned in the above item 1 . and isolating the genome fragment in the "pstre^ Part ("^"^ 200 to 500 
nucleotides from the translation initiation site). It is also possible to obtain a highly useful EMF by seeding an EMF 
showing a high expiession effidency or a desired induction pattern from among promoters captured by the EMF- 

capturing vedor as described above. ^. 

[00961 The ORF can be identified by extrading charaderistics common to Individual ORFs. constructing a general 
model based on these charaderistics. and measuring the confonnity of the subjed sequence with the "nodel. In the 
identifteation. a software, such as GeneMari. (Nuc. Acids. Res.. 22: 4756-67 ^l^^^- ^^'^^'^^'^^^.^^^^^^^ 
GeneMark.hmm (manufactured by GenePro). GeneHadcer (Protein. Nucleic Aadsnd Emyms. 300 -07 1 997)) 
Glimmer (Nuc. Adds. Res.. 2&. 544-548 (1 998): manufactured by The Institute of Genomic Research), or the like, can 
be used. In using the software, the default (Initial setting) parameters are usually used, though the parametere can be 

[MSr''^ln thrSve-described comparisons, a computer, such as UNIX. PC. Macintosh, orthe like, can be usad^ 
[009N Examples of the ORF determined by the method of the present Invention Include ORFs having the nudeotWe 
sequences represented by SEQ ID NOS:2 to 3501 present In the genome of Corynebsctertum 9'"'f'"'«f' *1''®P[?- 
sented by SEQ ID NO:1 . in these ORFs. polypeptides having the amino acid sequences represented by SEQ ID NOS: 

3S02 to 7001 are encoded. . ,^ ,.u r^oc..,-^ 

[00991 The fundion of an ORF can be detemtilned by comparing the identified amino acid se<J"«nM of the ORF with 
known homologous sequences using a homotogy searching software or comparator, such as Bl^ST, ^j^J Smfth & 
Watemian (Meth. Enzym.. 164: 765 (1988)) orthe like on an amino add data base, such as Swith-ProtJPIR. GenBank- 
nr-aa GenPept constituted by protein-encoding domains derived from GenBank data base. OWL orthe like. 
[01001 Furthemiore, by the homotogy searching, the Wentity and similarity with the amino acid sequences of known 

rOIOlT Wtthrespodrf'^eterm "Identity" used herein, where two polypeptides each having 10 amino adds are 
different in the positions of 3 amino acids, these polypeptides have an identity of 70% with eachother. in case where^ 
one of the different 3 amino acids is analogue (for example, leudne and isoleucine). these polypeptides have a similarity 

roioS" As a specific example. Table 1 shows the registration numbere in known data bases of sequences whk* are 
judged as having the highest similarity with the nucleotide sequence of the ORF derived from Copmebactertumglulanv- 
cum ATCC 13032. genes of these sequences, fundions of these genes, and identities thereof compared with known 
amino acid translation sequences. ^ H«.to™in!nn tho 

roi 031 Thus a great number of novel genes derived from corynefonn badena can be identified by determining the 
full nudeotide sequence of the genome derived from coiynefonn baderium by the means of the present invention. 
Moreover, the fundion of the proteins encoded by these genes can be detemiined. Since corynefonn bactena are 
industrially highly useful mtoroorganisms. many of the kJentified genes are Industnally useful. 
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[01 04] Moreover, the characteristics of respective microorganisms can be clarified by classifying the functions thus 
determined. As a result, valuable Information In breeding is obtained. 

[0105] Furthermore, from the ORF Information derived from coryneform bacteria, the ORF corresponding to the 
microorganism is prepared and obtained according to the general method as disclosed In Molecular Cloning, 2nd ed. 

s or the like. Specifically, an oligonucleotide having a nucleotide sequence adjacent to the ORF Is synthesized, and the 
ORF can be Isolated and obtained using the oligonucleotide as a primer and a chromosome DNA derived from co- 
ryneform bacteria as a template according to the general PGR cloning technique. Thus obtained ORF sequences 
Include polynucleotides comprising the nucleotide sequence represented by any one of SEQ ID NOS:2 to 3501. 
[0106] The ORF or primer can be prepared using a polypeptide synthesizer based on the above sequence Infonna- 

10 tion. 

[0107] Examples of the polynucleotide of the present invention include a polynucleotide containing the nucleotide 
sequence of the ORF obtained In the above, and a polynucleotide which hybridizes with the polynucleotide under 
stringent conditions. 

[0108] The polynucleotide of the present Invention can be a single-stranded DNA, a double-stranded DNA and a 

IS single-stranded RNA, though it is not limited thereto. 

[0109] The polynucleotide which hybridizes with the polynucleotide containing the nucleotide sequence of the ORF 
obtained in the above under stringent conditions includes a degenerated mutant of the ORF. A degenerated mutant Is 
a polynucleotide fragment having a nucleotide sequence which is different from the sequence of the ORF of the present 
Invention which encodes the same amino acid sequence by degeneracy of a gene code. 

^0 [01 10] Specific examples include a polynucleotide comprising the nucleotide sequence represented by any one of 
SEQ ID NOS:2 to 3431 , and a polynucleotide which hybridizes with the polynucleotide under stringent conditions. 
[01 1 1 ] A polynucleotide which hybridizes under stringent conditions Is a polynucleotide obtained by colony hybridi- 
zation, plaque hybridization, Southem blot hybridization or the like using, as a probe, the polynucleotide having the 
nucleotide sequence of the ORF Identified In the above. Specific examples Include a polynucleotide which can be 
identified by carrying out hybridization at eS'^C in the presence of 0.7-1 .0 M NaCI using a filter on which a polynucleotide 
prepared from colonies or plaques Is immobilized, and then washing the filter with O.lx to 2x SSC solution (the com- 
position of Ix SSC contains 150 mM sodium chloride and 15 mM sodium citrate) at 65**C. 

[01 1 2] The hybridization can be carried out In accordance with Icnown methods described In, for example, Molecular 
Cloning, 2nd ed., Cunent Protocols in Molecular Biology, DNA Cloning 1: Core Techniques, A Practical Approach,^ 

30 Second Edition, Oxford University (1995) or the like. Specific examples of the polynucleotide whteh cari be hybridized 
include a DNA having a homology of 60% or more, preferably 80% or more, and partknjiariy preferably 95% or more, 
with the nucleotide sequence represented by any one of SEQ ID NO:2.to 3431 when calculated using default (Initial 
setting) parameters of a homology searching software, such as BLAST, FASTA, Smith-Waterman or the like. 
[01 13] Also, the polynucleotide of the present invention includes a polynucleotide encoding a polypeptide comprising 

35 the amino acid sequence represented by any one of SEQ ID NOS:3502 to 6931 and a polynucleotide which hybridizes 
with the polynucleotide under stringent conditions. 

[0114] Furthermore, the polynucleotide of the present Invention includes a polynucleotide which is present in the 5' 
upstream or 3* downstream region of a polynucleotide comprising the nucleotide sequence of any one of ^EQ ID NOS: 
2 to 3431 in a polynucleotide comprising the nucleotide sequence represented by SEQ ID N0:1, and has an activity 
^ of regulating an expression of a polypeptide encoded by the polynucleotide. Specific examples of the polynucleotide 
having an activity of regulating an expression of a polypeptide encoded by the polynucleotide Includes a polynucleotide 
encoding the above described EMF, such as a promoter, an operator, an enhancer, a silencer, a ribosome-binding 
sequence, a transcriptional temrilnation sequence, and the like. 

[0115] The primer used for obtaining the ORF according to the above PGR cloning technique Includes an ollgonu- 
^ cleotide comprising a sequence whbh Is the same as a sequence of 1 0 to 200 continuous nucleotides In the nucleotide 
sequence of the ORF and an adjacent region or an oligonucleotide comprising a sequence which is complementary 
to the oligonucleotide. Specific examples Include an oligonucleotide comprising a sequence which is the same as a 
sequence of 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID NOS:1 
to 3431 , and an oligonucleotide comprising a sequence complementary to the otigonucleotide comprising a sequence 
50 of at least 1 0 to 20 continuous nucleotide of any one of SEQ ID NOS: 1 to 3431 . When the primers are used as a sense 
primer and an antisense primer, the above-described oligonucleotides In whk:h melting temperature (T„) and the 
number of nucleotides are not signifk:antly different from each other are preferred. 

[01 1 6] The oligonucleotide of the present invention includes an oligonucleotide comprising a sequence which is the 
same as 10 to 200 continuous nucleotides of the nucleotide sequence represented by any one of SEQ ID NOS:1 to 
5S 3431 or an oligonucleotide comprising a sequence complementary to the oligonucleotide. 

[0117] Also, analogues of these oligonucleotides (hereinafter also referred to as "analogous oligonucleotides') are 
also provided by the present invention and are useful in the methods described herein. 

[0118] Examples of the analogous oligonucleotides include analogous oligonucleotides in whteh a phosphodiester 
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great frequency, in a random manner. orovided bv the present invention, it is possible 

Stances. 

4 aarffication or determination of biosynthesis pathway and signal transmission pathway 

40 teriasirKje a numtwr Of genes have not been Identified SO far. 

[0127] -mese unlcnown points can be clarified by the fo''o*«"g "^f*^^^^^ „ by the method of above item 

t0128l ThefunctionaiinformationofORf^de^^^^^^ 

2 IS arranged. The temi "an-anged" means hat the ORF J^'^™" „ information according to the functional 

•c^Sn'TSTKcrs^sr^^ 

^sSng a mutant wherein the utilization frequency of this pathway b lowered. 
5. Clarification or determination of useful mutation point 

[01311 Manyusefulmutantsofco,yneformbacteriawhicharesuitablefortheproducUonofusefulsubstances.such 
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as amino acids, nucleic acids, vitannlns, saccharides, organic acids, and the like, have been obtained. However, It Is 
hardly known whk:h mutation point Is Imparted to a gene to Improve the productivity. 

[0132] However, mutation points contained In production strains can be Identified by comparing desired sequences 
of the genome DNA of the production strains obtained from coryneform bacteria by the mutagenic technique with the 
nucleotide sequences of the corresponding genome DNA and ORF derived from coryneform bacteria detemnined by 
the methods of the above Items 1 and 2 and analyzing them 

[01 33] Moreover, effective mutation points contributing to the production can be easily specified from among these 
mutation points on the basis of known information relating to the metabolic pathways, the metabolic regulatory mech- 
anisms, the structure activity correlation of enzymes, and the like. 

[01 34] When any eff teient mutation can be hardly specified based on known data, the mutation points thus Identified 
can be introduced into a wild strain of cory nef omn bacteria era production strain free of the mutation. Then, it Is examined 
whether or not any positive effect can be achieved on the production. 

[0135] For example, by comparing the nucleotide sequence of homoserine dehydrogenase gene horn of a lysine- 
produclng B-6 strain of Corynebacterium gtutamicum {AppL MicrobioL Blotechnof., 32: 269-273 (1989)) with the nu- 
cleotide sequence corresponding to the genome of Corynebacterium glutamicumP^CC 1 3032 according to the present 
Invention, a mutation of amino acid replacement In whteh valine at the 59'positlon Is replaced with alanine (Val59Ala) 
was Identified. A strain obtained by Introducing this mutation into the ATCC 13032 strain by the gene replacement 
method can produce lysine, which Indtoates that this mutation Is an effective mutation contributing to the production 
of lysine. 

[0136] SImiiariy. by comparing the nucleotide sequence of pyruvate carboxylase gene pyc of the B-6 strain with the 
nucleotide sequence corresponding to the ATCC 1 3032 genome, a mutation of amino acid replacement in which proline 
at the 458-posltion was replaced with serine (Pro458Ser) was klentified. A strain obtained by introducing this mutation 
Into a lysine-producing strain of No. 58 (PERM BP-7134) of Corynebacterium gkitamlcumiTBe of this mutation shows 
an Improved lysine productivity In comparison with the No. 58 strain, which indicates that this mutation Is an effective 
mutation contributing to the production of lysine. 

[01 37] In addition, a mutation A1 a21 3Thr in glucose-6-phosphate dehydrogenase was specified as an effecth^e mu- 
tation relating to the production of lysine by detecting glucose-6-phosphate dehydrogenase gene zwfol the B-6 strain. 
[0138] Furthermore, the lysine-productivlty of Corynebacterium glutamicum was Improved by replacing the base at 
the 932-position of aspartokinase gene tysC of the Corynebacterium glutamicum KTOC 13032 genome with cytosine 
to thereby replace threonine at the 31 1 -position by isoieucine, whk:h indk^ates that this mutation is an effective mutation 
contributing to the production of lysine. 

[0139] Also, as another method to examine whether or not the identified mutation point is an effective mutation, there 
is a method in which the mutation possessed by the lysine-produclng strain is returned to the sequence of a wild type 
strain by the gene replacement method and whether or not It has a negative Influence on the lysine productivity. For 
example, when the amino acid replacement mutation Vai59Ala possessed by horn of the lyslne-producing B-6 strain 
was returned to a wild type amino acid sequence, the lysine productivity was lowered in comparison with the B-6 strain. 
Thus, it was found that this mutation Is an effective mutation contributing to the production of lysine. 
[01 40] Effective mutation points can be more efficiently and comprehensively extracted t>y combining. If needed, the 
DNA array analysis or proteome analysis described below. 

6. Method of breeding industrially advantageous production strain 

[0141] It has been a general practice to construct production strains, whk:h are used Industrially In the fennentation 
production of the target useful substances, such as amino acids, nucleic acids, vitamins, saccharides, organic acids, 
and the like, by repeating mutagenesis and breeding based on random mutagenesis using mutagens, such as NTG 
or the like, and screening. 

[01 42] in recent years, many examples of Improved production strains have been made through the use of recom- 
binant DNA techniques. In breeding, however, most of the parent production strains to be Improved are mutants ob- 
tained by a conventional mutagenic procedure (W. Leuchtenberger, Amino Acids - Technical Production and Use. In: 
Roehr (ed) Biotechnology, second edition, vol. 6. products of primary metabolism. VCH Veriagsgesellschaft mbH, Weln- 
helm. P465 (1996)). 

[0143] Although mutagenesis methods have largely contributed to the progress of the fermentation industry, they 
suffer from a serious problem of multiple, random Introduction of mutations into every part of the chromosome. Since 
many mutations are accumulated in a single chromosome each time a strain is Improved, a production strain obtained 
by the random mutation and selecting is generally inferior in properties (for example, showing poor growth, delayed 
consumption of saccharides, and poor resistance to stresses such as temperature and oxygen) to a wild type strain, 
whk:h brings about troubles such as failing to establish a sufficiently elevated productivity, being frequently contami- 
nated with miscellaneous bacteria, requiring troublesome procedures in culture maintenance, and the like, and, In its 
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and the access device utilized. Also, various data processing programs, software, comparator and formats are used 
for recording and utilizing the polynucleotide sequence infomnatlon or the lilce. of the present Invention In the recording 
medium. The Infomnation can be expressed in the fomn of a binary file, a text file or an ASCII file formatted with com- 
mercially available software, for example. Moreover, software for accessing the sequence Information Is available and 

s known to one of ordinary skill in the art. 

[0183] Examples of the information to be recorded in the above-described medium include the full genome nucleotide 
sequence Infomiation of coryneform bacteria as obtained in the above Item 2, the nucleotide sequence Infonmation of 
ORF, the amino acid sequence infomnatlon encoded by the ORF, and the functional information of polynucleotides 
coding for the amino acid sequences. 

10 [01 84] The recording medium or storage devtee whteh Is readable by a computer according to the present invention 
refers to a medium In which the information of the present invention has been recorded. Examples Include recording 
media or storage devices which are readable by a computer storing the nucleotide sequence Infomnatlon represented 
by SEQ 10 NOS:1 to 3501. the amino acid sequence infonnation represented by SEQ ID NOS:3502 to 7001, the 
functional information of the nucleotide sequences represented by SEQ ID NOS:1 to 3501 . the functional infomriatlon 

15 of the amino acid sequences represented by SEQ ID NOS:3502 to 7001 , and the Information listed in Table 1 below 
and the like. 

9. System based on a computer using the recording medium of the present Invention which Is readable by a computer 

^ [0185] The term "system based on a computer" as used herein refers a system composed of hardware devtee(s), 
' software device(s), and data recording devlce(s) which are used for analyzing the data recorded in the recording me- 
dium of the present invention whk:h is readable by a computer. 

[0186] The hardware device(s) are, for example, composed of an Input unit, a data recording unit, a central processing 
unit and an output unit collectlvely or individually. 

[0187] By the software device(s), the data recorded in the recording medium of the present invention are searched 
or analyzed using the recorded data and the hardware device(s) as described herein. Specif k:ally, the software device 
(8) contain at least one program whteh acts on or with the system in order to screen, analyze or compare biologically 
meaningful structures or information from the nucleotide sequences, amino acid sequences and the like recorded in 
the recording medium according to the present invention. 

30 [0188] Examples of the software device (s) for identifying ORF and EMF domains include GeneMark {Nuc. Acids, 
Res., 22: 4756-67 (1994)), GeneHacker {Protein, Nucieic Acid and Enzyme, 42: 3001-07 (1997)), Glimmer (The Insti- 
tute of Genomic Research; Nuc, Adds, Res,, 26: 544-548 (1 998)) and the like. In the process of using such a software 
device, the default (initial setting) parameters are usually used, although the parameters can be changed, if necessary, 
in a manner known to one of ordinary skill in the art. 

35 [0189] Examples of the software devlce(s) for Identifying a genome domain or a polypeptide domain analogous to 
the target sequence or the target structural motif (homology searching) include FASTA, BLAST, Smith-Waterman, 
GenetyxMac (manufactured by Software Development), GCG Package (manufactured by Genetic Computer Group), 
GenCore (manufactured by Compugen), and the like. In the process of using such a software device, the default (initial 
setting) parameters are usually used, although the parameters can be changed. If necessary. In a manner known to 

40 one of ordinary sklIMn the art 

[01 90] Such a recording medium storing the full genome sequence data Is useful In preparing a polynucleotide array 
by whk:h the expression amount of a gene encoded by the genome DNA of corynefomi bacteria and the expression 
profile at the total gene level of the microorganism, namely, which genes among many genes encoded by the genome 
have been expressed and the expression ratio thereof, can be detemnined. 

^ [0191] The data recording device(s) provided by the present Invention are, for example, memory devlce(s) for re- 
cording the data recorded In the recording medium of the present invention and target sequence or target structural 
motif data, or the like, and a memory accessing device(s) for accessing the same. 

[01 92] Namely, the system based on a computer according to the present invention comprises the following: 

50 (i) a user input device that inputs the information stored In the recording medium of the present inventton, and 

target sequence or target structure motif infomnatlon; 

(ii) a data storage device for at least temporarily storing the input Infomnatlon; 

(iii) a comparator that compares the information stored In the recording medium of the present invention with the 
target sequence or target structure motif information, recorded by the data storing device of (ii) for screening and 

55 analyzing nucleotide sequence information which is coincident with or analogous to the target sequence or target 

structure motif Information; and 

(Iv) an output devtoe that shows a screening or analyzing result obtained by the comparator. 



21 



EP1 108 790 A2 



10 



15 



SO 



25 



30 



35 



40 



u. i« th« methods in items 2 to 5 as described above for searching and analyzing the ORF 

of orthologs and paralogs. 

10. ProducUon of polypeptide using ORF derived from corynefonn bacteria 
[0194] Thepolypeptideofthepresentinventioncanbeproducedusinga^ 

In the above item 2. Specmca.V. the P^'VP^P^^^ iS MoJuisr aoning. 

cleotide of the present invention or a fragment V^' ^f^^ to the following method. 

2nd ed.. Current Protocols in Molecular Bu>logy. and tl^e l.te. =^1^^^^^ ^ prepared from the full 

[01951 A DNA fragment having a suitable length containing a part encoding the polypepnae is p p 

length ORF sequence, if necessary. .„„.,^„no At a oart encodina the polypeptide of the present 

L.mbm=n. vector oont^nlna the DNA «»j*nj^«^^ „" 
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PET11 (manufactured by Stratagene). pBAD. pThioHte and Pj^f 
;GB(2^(manufacturedbyAme.ham^^^^^^^^ 

turedby BoehringerMannhe|m CO.). PSE2M "^^^^ unexamined Patent Application No. 110600/83). 

pQE-8 (manufactured by QIAGEN). pKYPIO {Japanee^ o, i Vv.^ <w !>77 M989tt dGEL1 (Proc. Naff. AcaH. 
SkYP200 (Agric: Chem.. 4ft 669 (1984)). P^SAI (^ffric^S/oA CJe^^^^ o^i^oTreprredfrom Escherichia coH 
L. USA. «^ 4306 (1985)). pBluescript II SK(-) ('^^^/^^i^*' J^^f J^^f^^S pGHA2 
JM109/pTrS30 (FERM BP^5402). pTj32 ft^^^^^^^ 
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mized, In a known manner, depending on the host cells and environmental conditions utilized. 
[0207] Examples of the host cell Include microorganisms belonging to the genus Escherichia, the genus Sermtia, 
the genus Bacillus, the genus Brevlbacterium, the genus Corynebaclerium, the genus Mlcmbacterfum, the genus Pseu- 
domonas, and the like. Speclfte examples Include Escherk^la co// XL1 -Blue, Escherichia co//XL2-Blue. Escherichia 
5 coll DH1 , Escherichia coll MCI 000. Escherichia coff KY3278. Escherichia coll W1 485. Escherichia cofUm 09, Es- 
cherlchia co//HB101 , Escherichia coliNo. 49, Escherichia co//W3110, Escherichia co//NY49, Escherichia co//GI698. 
Escherichia coil TB^ , Serratia ficaria, Sermtia fonticota, Serratia liquefaciens, Serratia marcescens, Bacillus subtills] 
Bacillus amyloiiquefadens, Corynebacterium ammonia genes, Brevibacterium immariophilum PJCC 14068, Brevibac- 
teriumsaccharolyticum ATCC 14066, Corynebacterium glutamk:um PiTCC 13032, Co/ynedacferfLrmglutamlcum ATCC 
13869, Corynebacterium giutamicum ATCC 14067 (prior genus andspecles: Brgvibacteriumflavum), Corynebacterium 
glutamlcum fiJCC 13869 (prior genus and species: Brevlbacterium lactofermentum, or Corynebacterium lactotermen- 
turn), Corynebacterium acetoackfophilum ATCC 13870. Corynebacterium thermoamlnogenes PERM 9244. Microbac- 
terium ammoniaphilum AiTCC 16364, Pseudomonas putida, Pseudomonas sp. D-0110, and the like. 
[0208] When Corynebacterium giutamicum or an analogous microorganism is used as a host, an EMF necessaiy 

*5 for expressing the polypeptide Is not always contained in the vector so long as the polynucleotide of the present In- 
vention contains an EMF. When the EMF is not contained In the polynucleotide, It is necessary to prepare the EMF 
separately and ligate it so as to be In operable combination. Also, when a higher expression amount or specific ex- 
pression regulation is necessary. It Is necessary to ilgate the EMF corresponding thereto so as to put the EMF in 
operable combination with the polynucleotide. Examples of using an extemally llgated EMF are disclosed in Microbi- 

20 O/OSK 1297-1309(1996). 

[0209] With regard to the method for the Introduction of the recombinant vector, any method for introducing DNA into 
the above-described host cells, such as a method in which a calcium ion is used {Proc. Natl. Acad. Sd. USA, 5P: 2110 
(1972)), a protoplast method (Japanese Published Unexamined Patent Application No. 2483942/88), the methods 
described In Gene, 17:^07 (1982) and Molecular & General Genetics, 168: 111 (1979) and the like, can be used. 

25 [0210] When yeast is used as the host cell, examples of the expression vector Include pYES2 (manufactured by 
invitrogen), YEp13 (ATCC 37115), YEp24 (ATCC 37051), YCpSO (ATCC 37419), pHS19, pHS15, and the like. 
[0211] Any promoter can be used so long as it can be expressed In yeast. Examples Include a promoter of a gene 
in the glycolytic pathway, such as hexose kinase and the like, PH05 promoter, PGK promoter, GAP promoter, ADH 
promoter, gal 1 promoter, gal 10 promoter, a heat shock protein promoter, MF al promoter, CUP 1 promoter, and the like. 

30 [0212] Examples of the host ceil include microorganisms belonging to the genus Saccharomyces, the genus 
Schizosaccharomyces, the genus Kluyveromyces, the genus Trichosporon, the genus Schvvannlomyces, the genus 
PIchia, the genus Candida and the like. Specify examples include Saccharomyces cerevlslae, Schizosaccharomyces 
pombe, Ktuyveromyces lactis, Trichosporon puilulans, Schwanniomyces alluvlus, Candida utills and the like. 
[021 3] With regard to the method for the Introduction of the recombinant vector, any method for Introducing DNA into 

3S yeast, such as an eiectroporatlon method (Methods. EnzymoL, 194: 182 (1990)), a spheroplast method {Proc. Natl. 
Acad. Sd. USA, 75: 1929 (1978)). a lithium acetate method (J. Bacterid., 153: 163 (1983)). a method described in 
Proc. Natl. Acad. Scl. USA, 75: 1 929 (1 978) and the like, can be used. 

[021 4] When animal cells are used as the host cells, examples of the expression vector include pcDNA3. 1 . pSinRepS 
and pCEP4 (manufactured by Invttorogen). pRev-Tre (manufactured by Clontech), pAxCAwt (manufactured by Takara 
40 Shuzo), pcDNAI and pcDM8 (manufactured by Funakoshl). pAGEI 07 (Japanese Published Unexamined Patent Ap- 
plteation No. 22979/91; Cytotechnology, ai33 (1990)), pAS3-3 (Japanese Published Unexamined Patent /Vppilcatlon 
No. 227075/90), pcDM8 (Nature, 329: 840 (1987)), pcDNAI/Amp (manufactured by Invitrogen), pREP4 (manufactured 
by invitrogen), pAGE103 (J. Biochem., 101: 1307 (1987)), pAGE210, and the like. 

[0215] Any promoter can be used so long as it can function in animal cells. Examples Include a promoter of IE 
^ (immediate earty) gene of cytomegalovirus (CMV), an earty promoter of SV40, a promoter of retrovirus, a metal- 
lothloneln promoter, a heat shock" prorfjoter, SRa promoter, and the like. Also, the enhancer of the IE gene of human 
CMV can be used together with the promoter. 

[0218] Examples of the host cell include human Namalwa cell, monkey COS cell, Chinese hamster CHO cell. 
HST5637 (Japanese Published Unexamined Patent Application No. 299/88). and the like. 
so [0217] The method for introduction of the recombinant vector into animal cells Is not particularly limited, so long as 
It is the general method for Introducing DNA Into animal cells, such as an eiectroporatlon method (Cytotechnology, 3. 
133 (1990)), a calcium phosphate method (Japanese Published Unexamined Patent Appiteation No. 227075/90), a 
iipofectlon method (Proc. Natl. Acad. Sci. USA, 84, 7413 (1987)), the method described In Virology, 52: 456 (1973), 
and the like. 

[021 8] When insect ceils are used as the host cells, the polypeptide can be expressed, for example, by the method 
described in Bacurovirus Expression Vectors, A l-aboratory Manual, W.H. Freeman and Company. New Yoric (1992). 
Bic/Technology 6: 47 (1 988). or the like. 

[021 9] Specifically, a recombinant gene transfer vector and bacurovirus are simultaneously inserted into Insect cells 
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an Inducer can be added to the medium, If necessary. 

[0240] For example, Isopropyl-p-D-thlogalactopyranoslde (IPTG) or the like can be added to the medium when a 
microorganism transfonned with a recombinant vector containing lac promoter Is cultured, or indoleacryllc acfd (lAA) 
or the like can by added thereto when a microorganism transformed with an expression vector containing fip promoter 
Is cultured. 9 

[0241] Examples of the medium used In culturing a transformant obtained using animal cells as the host cells Include 
RPM1 1640 medium (The Journal of the American Medical Association, 519 (1967)), Eagle's MEM medium {Sd- 
ence, 122. 501 (1952)), Dulbecco's modified MEM medium {Viroiogy, S, 396 (1959)), 199 Medium (Proceeding of the 
Society for the Bioiogical Medicine, 73:1 (1 950)). the above-described media to which fetal calf serum has been added 
and the like. ' 

[0242J The culturing is canried out generally at a pH of 6 to 8 and a temperature of 30 to 40'»C In the presence of 5% 
CO2 for 1 to 7 days. 

[0243] Also, if necessary, antibiotics, such as kanamycln. pentoillin. and the like, can be added to the medium during 
the culturing. ^ 

[0244] Examples of the medium used in culturing a transformant obtained using Insect cells as the host cells Include 
TNM-FH medium (manufactured by Pharmlngen), Sf-900 II SFM (manufactured by Life Technologies). ExCeli 400 and 
ExCell 406 (manufactured by JRH Biosciences), Grace's Insect Medium (Nature, 195: 788 (1962)), and the like. 
[0245] The culturing Is canied out generally at a pH of 6 to 7 and a temperature of 25 to 30»C for 1 to 5 days. 
[0246] Additionally, antlblottes, such as gentamtoln and the like, can be added to the medium during the culturing, if 
necessary. 

[0247] A transfomiant obtained by using a plant cell as the host cell can be used as the cell or after differentiating 
to a plant ceil or organ. Examples of the medium used in the culturing of the transfomiant include Murashige and Skoog 
(MS) medium, White medium, media to which a plant homrione, such as auxin, cytokinlne, or the like has been added, 
and the like. 

[0248] The culturing Is canied out generally at a pH of 5 to 9 and a temperature of 20 to 40»C for 3 to 60 days. 
[0249] Also, antlblottes, such as kanamycln. hygromycin and the like, can be added to the medium during the cul- 
turing, If necessary. 

[0250] As described above, the polypeptide can be produced by culturing a transfomnant derived from a mteroor- 
ganism, animal cell or plant cell containing a recombinant vector to which a DNA encoding the polypeptide of the 
present invention has been Inserted according to the general culturing method to produce and accumulate the polypep- 
tide, and recovering the polypeptide from the culture. 

[0251] The process of gene expression may include secretion of the encoded protein production or fusion protein 
expression and the like In accordance with the methods described In Molecular Cioning, 2nd ed.. in addition to direct 
expression. 

[0252] The method for producing the polypeptide of the present Invention includes a method of Intracellular expres- 
sion in a host cell, a method of extracellular secretion from a host cell, or a method of production on a host cell membrane 
outer envelope. The method can be selected by changing the host cell employed or the structure of the polypeptide 
produced. 

[0253] When the polypeptide of the present invention Is produced In a host cell or on a host cell membrane outer 
envelope, the polypeptide can be positively secreted extracellulariy according to, for example, the method of Paulson 
et al, {J. BioL Chem., 264: 17619 (1989)), the method of Lowe et al. (Proc. Natl. Acad, Sc/. USA, 86: e227 (1989); 
Genes Devetop,, 4: 1288 (1990)), and/or the methods described In Japanese Published Unexamined Patent Application 
No. 336963/93. WO 94/23021 , and the like. 

[0254] Specifically, the polypeptide of the present Invention can be positively secreted extracellulariy by expressing 
It in the fonri that a signal peptide has been added to the foreground of a polypeptide containing an acUve site of the 
polypeptide of the present Invention according to the recombinant DNA technique. 

[0255] Furthermore, the amount produced can be increased using a gene amplification system, such as by use of 
a dihydrofolate reductase gene or the like according to the method described in Japanese Published Unexamined 
Patent Application No. 227075/90. 

[0256] Moreover, the polypeptide of the present invention can be produced by a transgenk: animal indivkfual (trans- 
genic nonhuman animal) or plant individual (transgenic plant). 

[0257] When the transformant is the animal individual or plant Individual, the polypeptide of the present Invention 
can be produced by breeding or cultivating It so as to produce and accumulate the polypeptide, and recovering the 
polypeptide from the animal individual or plant individual. 

[0258] Examples of the method for producing the polypeptide of the present invention using the animal individual 
include a method for producing the polypeptide of the present invention In an animal developed by Inserting a gene 
according to methods known to those of ordinary skill in the art (Amertoan Journal of Clinical Nutrition, 63: 639S (1 996). 
American Journal of Clinical Nutrition, 63: 627S (1 996), BfoHechnology, 9: 830 (1 991 )). 
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include a method for producing the polypeptide of me Pffs®"^";^™^ ^ ^ ^0 (1994), Tissue Culture. 21 

[02621 The polypeptide of the present invention can be produced a ™ai y ,3^^ another 

example, two I ^.translation methods which '^--i^^J';^,^^^^ ^n /^Wfrotran'scriptlon product. 

method using DNA as a template. The template RNAincludes^^ ^ ^^^^ 

and the like. The template DNA Includes « P'"^^;^^^-^^^^^^^^^^ To select the most suttable system 

therein and downstream of the inrt.ation site, a '=^'^-P°" synthesized (prokaryotic cell/eucaryotic 

for the in vitro translation, the origin of the gene «n~«;ng *e ^^'^ ^^^^^^^ J^the like should be consld- 

cell).thetypecfthetemplate(DNAmNA).thepu^os^^^^^^^ 

r^Mr„hr?™"rarz^^^^^^ 

S2TrJSSSansla.onofaONAnuc.o^^^^^^^^^^ 

be carried out using an '"^f^^^^f J^^^^^^^^^^ as aUplate. a linear prokaryotic 

ufactured by Promega, catalogue No. L1130). ^'^^'^^sP^^^^gon) XPL. or the like, can be carried out using 

reaction or translation reaction, and the like can be earned out u^^^^ 

[02651 The polypeptide produced by the ^^"-'X por^JiSfJSSrpt^^^^ of the present invention is 
general method for isolating and punfymg an j^^^^^^^* ^1 centrlfugatton after cultivation, suspended 

expressed as a soluble product in the host ceUs. "^^l";'^^*^^^^^ f Canton Gaulln homogenlzer. a Dynomlll. 
in^aqueousbuffer.anddlsmpted using an ultrasonto^^^^ 

or the like to obtain a cell-free extract. From the supe^^atent obtained by oen g g ^ ^ 

product can be obtained by the aene^'^e or Like ^ZSg XZ^nZ^o^u organk: solvent, anion ex- 
traction, salting out using ammonium """^^ "^.^^^.^^^^^ HPA-75 (manufactured 

such as isoelectronic focusing or the like, alone or '^'V'''"^^®" ^^"^^^^^ „„s the cells are collected in the 
[^66] When the polypeptide is expressed as '"/^'"^^^^^^^^^^ 
same manner, disrupted and centrifuged to recover me nso^ 

Next, the insoluble product of the polypeptide is '"^''"^^.^JJ^^^^^^^^^^ soLon. Thus, the nomial con- 

;-:rn:;ro:;;rer^^^^^^ 
?j2r=nrs^^^^^^ 

asugarchainthereto)l8 8ecretedout of cells, thepolypeptideoritsdenvawec^^^ treatment similar to the above (for 
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and examples Include a polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from 
SEQ ID NOS:2 to 3431, and a polypeptide comprising an amino acid sequence represented by any one oT SEO ID 
NOS:3502to6931. 

[0269] Furthemiore, a polypeptide comprising an amino acid sequence In which at least one amino acids is deleted, 

5 replaced, Inserted or added in the amino acid sequence of the polypeptide and having substantially the same activity 
as that of the polypeptide is Included In the scope of the present invention. The temri "substantially the same activity 
as that of the polypeptide" means the same activity represented by the inherent function, enzyme activity or the Wke 
possessed by the polypeptide which has not been deleted, replaced, Inserted or added. The polypeptide can be ob- 
tained using a method for introducing part-specific mutation(s} described in, for example, Molecular Cloning, 2nd ed.. 

10 Current Protocols In Molecular Biology, Nuc, Adds. Res,, 10: 6487 (1 982), Proc. Natl. Acad, Sd, USA, 79: 6409 (1 982), 
Gene, 34: 315 (1985), Nuc. Adds. Res., 13: 4431 (1985), Proc. Natl. Acad. Sd. USA, 82: 488 (1985) and the like. For 
example, the polypeptide can be obtained by introducing mutatlon(s) to DNA encoding a polypeptide having the amino 
acid sequence represented by any one of SEQ ID NOS:3502 to 6931 . The number of the amino acids which are deleted, 
replaced, Inserted or added Is not particulariy limited; however, It Is usually 1 to the order of tens, preferably 1 to 20, 

ts more preferably 1 to 1 0, and most preferably 1 to 5, amino acids. 

[0270] The at least one amino acid deletion, replacement, insertion or addition in the amino acid sequence of the 
polypeptide of the present invention is used herein to refer to that at least one amino acid is deleted, replaced, inserted 
or added to at one or plural positions In the amino acid sequence. The deletion, replacement, insertion or addition may 
be caused In the same amino acid sequence simultaneously. Also, the amino acid residue replaced, inserted or added 

^ can be natural or no n- natural. Examples of the natural amino acid residue include L-alanlne, L-asparagine, L-asparatIc 
acid, L-glutamine, L-glutamIc acid, glycine, L-hlstidine, L-isoleuclne, L-leuclne, L-lyslne, L-methlonlne, L-phenylalanine, 
L-prollne. L-serine, L-threonlne, L-tryptophan, L-tyrosine, L*vaiine, L-cysteine, and the like. 

[0271] Herein, examples of amino acid residues which are replaced with each other are shown below. The amino 
acid residues in the same group can be replaced with each other. 

25 

Group A: 

[0272] leucine, isoleucine. norieucine, valine, norvaline, alanine, 2-aminobutanok: acid, methionine, O-methyiserine, 
t-butytglycine, t-butylalanine, cyclohexylalanlne; 

30 

Group B: 

[0273] asparatic acid, glutamic acid, isoasparatic acid, Isoglutamic acid, 2-aminoadipic acid, 2-amlnosuberic acid; 
35 Group C: 

[0274] asparagine. giutamine; 
Group D: . 

40 

[0275] lysine, arglnine, ornithine, 2,4-dlaminobutanolc acid, 2,3-diaminopropionlc acid; 
Group E: 

^ [0276] proline, 3-hydroxyproline, 4-hydroxyproline; 
Group F: 

[0277] serine, threonine, homoserine; 

50 

Group G: 

[0278] phenylalanine, tyrosine. 

[0279] Also, in order that the resulting mutant polypeptide has substantially the same activity as that of the polypeptide 
S5 Which has not been mutated, it Is prefen^ed that the mutant polypeptide has a homology of 60% or more, preferably 
80% or more, and particulariy preferably 95% or more, with the polypeptide whteh has not been mutated, when calcu- 
lated, for example, using default (Initial setting) parameters by a homology searching software, such as BLAST, FASTA, 
or the like. 
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of the present Invention. from an amino add. a nucleic acid, a vitamin, a saccharide, an 

[0282] SpecificalV-atleastonecomponentsJe^^fr^^^^^ 
organic acid, and analogues thereof can be produc^^^^^^ 

;S^r ^t'Sl^synthes. pathways, ^-ecomposnion pathway and r^^^^^^^^ 

Lbstlnces such as amino acids, nudeic acids — ; using these di^er- 

organism to organism. The productivity of such a phys^lcgic^^^ 

«Ls. specifically by introducing a heterogeneous gene ^^J^J^^^'J^p^^^ by Introducing a synthase gene 
of lysine, which is one of the essential an.lno ^^^'J^^l^l^^^^^^^^^ 
derived fromabacteriumOW0 93/19190).Ws<^^a^ln^^^^^ 

synthase gene derived from Escherichia ^ ^j.^^^'^^^T'^^l^^lf^^^ni according to the present Invention can 
[0284] To produce such a physiologically actwe ^"''.«*a"^f ' polypeptide of the present 

Le cuLedSr the samemethod as employed ncuium|^^^^^ 

invention as described above. Also, the Phys'^'osl^a'? ^^^^^^^^^^ method and other known methods. 

in combination with, for example, the ion -^•'^^"S^^f^^" '"^^^^^^^^^ calcium transfection. 

[0285] Examples of methods known o one °» °f J ^^^Vn »e host Is a bacterium; and microinjection. 

fte protoplast method, the method usmg a Pl^^S^' ^""^^ ^"^^J,,^^^^^ and the method using a virus, and the 

calcium phosphate transfection. the P^^^^^^l^^'Pf^^'^f'^^^^^^^ Ce/fa/a fabo/atory manual. CoW Spring 
like. When the host is a eukaryote (Mo/ecuter C/on^gJ^^^^^^ 

Ha*ourLaborato,y Press. 1998)). Dcampesof^h^^^^^^^ 
higher eukaryotes (for example, mammals . and oensj^^^^^^^^ 

fragmentpresentlnthehostceils. it««nbe,ntegratedlntothe^ chromosome. These trans- 

into a factor (for example, a plasmid) having ^o6b6 by the ORF of the genome of 

foments are usable in producing the P°'yP^P^<^/ °' ^^^^^ and fragmeris thereof. Alternatively, they 

35 11 Preparatlonofantlbodyrecognlzlngthepolypeptideofthepresentlnventlon 
[0288] Anantibodywh^hrecognlzesmepo^spUdeo™ 

rrn^n^xr^^^^^^^^^^ 

40 polypeptide of the present Invention. 
(1) Production of polyclonal antibody 

loan A <k»a9. « the aniwn 1. P«»^» l°J"JK"p^««««™* '»™"'""»^'^'"' 

.ne. me <M aartnwrauon. a. m. « » m ffJJf.'^™^'S?L\Lgenw«»er«™irmur^ 
.en«»pl....o..h.e,«,«.»d»^»»r<™^ 

(Enzyme-linked Immunosorbent Assay (BUSA), igaKu onnm ^ • i* 

Harbor Laboratory (19B8)) or the like. K..mflnmRmmal with a sufficient antibody titer against the antigen 

K„=:rrrr=srz;rrs„.po^«».* 
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[0293] Examples of the method for the Isolation and purification Include centrtfugation, salting out by 40-60% satu- 
rated ammonium sulfate, caprylic acid precipitation (Antibodies, A l.aboratory manual. Cold Spring Harbor Laboratory 
(1 988)). or chromatography using a DEAE-Sepharose column, an anion exchange column, a protein A- or G^olumn, 
a gel filtration column, and the like, alone or In combination thereof, by methods known to those of ordinary skill in the art! 

(2) Production of monoclonal antibody 
(a) Preparation of antibody-producing cell 



[02941 A rat having a serum showing an enough antibody titer against a partial fragment polypeptide of the polypep- 
tide of ttie present invention used for Immunization is used as a supply source of an antibody-producing ceil. 
[0295] On ttie 3rd to 7th day after the antigen substance is finally administered the rat showing ttie antibody titer the" 
spleen is excised. 

[0298] The spleen is cut to pieces in MEM medium (manufactured by NIssul Pharmaceutical), loosened using a pair 
of forceps, followed by centrifugation at 1 .200 rpm for 5 minutes, and ttie resulting supernatant is discarded. 
[0297] The spleen In the precipitated fraction is treated with a Tris-ammonlum chloride buffer (pH 7.65) for 1 to 2 
minutes to eliminate erythrocytes and washed ttiree times with MEM medium, and the resulting spleen cells are used 
as antibody-producing cells. 

^ (b) Preparation of myeloma cells 

[0298] As myeloma cells, an established ceil line obtained from mouse or rat Is used. Examples of useful cell lines 
include those derived from a mouse, such as P3-X63Ag8-U1 (hereinafter referred to as "PS-UI ") (Cum Topics in h/licto- 
bloi, Immunoi,, 81: 1 (1978); Europ. J. Immunoi., 6: 511 (1976)); SP2/0-Ag!4 (SP-2) {Nature, 276: 269 (1978))- 

2S P3-X63-Ag8653 (653) (J. Immunoi., 123: 1 548 (1 979)); P3-X63-Ag8 (X63) cell line (Nature, 256: 495 (1 975)), and the 
like, which are 8-azaguanine-resistant mouse (BALB/c) myeloma cell lines. These cell lines are subcultured In 8-aza- 
guanine medium (medium in whteh, to a medium obtained by adding 1.5 mmol/l glutamine, 5x10*5 moi/l 2-mercap- 
toethanol. 1 0 ^g/ml gentamlcin and 1 0% fetal calf serum (FCS) (manufactured by CSL) to RPMI-1 640 medium (here- 
inafter referred to as the "nomiai medium"), 8-azaguanine Is furtiier added at 15 jig/ml) and cultured in the normal 

30 medium 3 or 4 days before cell fusion, and 2x 1 0^ or more of the cells are used for the fusion. 

(c) Production of hybridoma 



[0299] The antibody-producing cells obtained In (a) and the myeloma ceils obtained In (b) are washed with MEM 
medium or PBS (disodium hydrogen phosphate: 1 ,83 g, sodium dihydrogen phosphate: 0.21 g, sodium chloride: 7.65 
g, distilled water 1 liter, pH: 7.2) and mixed to give a ratio of antibody-producing cells : myeloma cells » 5 : 1 to 10 : 
1 . followed by centrifugation at 1 ,200 rpm for 5 minutes, and the supernatant Is discarded. 

[0300] The ceils In the resulting precipitated fraction were thoroughly loosened, 0.2 to 1 ml of a mixed solution of 2 
g of polyethylene glycol-1000 (PEG-1000), 2 mi of MEM medium and 0.7 ml of dimethylsulfoxide (DMSO) per 10® 
antibody-producing ceils is added to ttie cells under stining at 37«C. and ttien 1 to 2 ml of MEM medium Is furtiier 
added thereto several times at 1 to 2 minute intervals. 

[0301] After the addition, MEM medium Is added to give a total amount of 50 ml. The resulting prepared solution Is 
centrifuged at 900 rpm for 5 minutes, and then the supernatant Is discanied. The ceils in the resulting precipitated 
fraction were gently loosened and then gently suspended in 1 00 ml of HAT medium (the nonmal medium to which 1 0*^ 
mol/l hypoxanthine, 1 .5x10-5 mol/l ttiymidlne and 4x10-7 mol/l aminopterin have been added) by repeated drawing 
up Into-and discharging from a measuring pipette.- — 

[0302] The suspension Is poured Into a 96 well culture plate at 1 00 ^lAwell and cultured at 37»C for 7 to 14 days In 
a 5% CO2 incubator. 

[0303] After cuituring, a part of the culture supernatant is recovered, and a hybridoma which specifically reacts with 
a partial fragment polypeptide of the polypeptide of the present Invention Is selected according to the enzyme Immu- 
noassay described in Anybodies, A ijaboratory manual. Cold Spring Harbor Laboratory, Chapter 14(1 998) and the like. 
[0304] A specific example of the enzyme immunoassay is described below. 

[0305] The partial fragment polypeptide of the polypeptide of the present Invention used as the antigen in the Immu- 
nization is spread on a suitable plate, is allowed to react witti a hybridoma cuituring supernatant or a purified antibody 
obtained in (d) described below as a first antibody, and Is further allowed to react wItti an anti-rat or anti-mouse Immu- 
noglobulin antibody labeled with an enzyme, a chemical luminous substance, a radioactive substance or the like as a 
second antibody for reaction suitable for the labeled substance. A hybridoma whteh specifically reacts with the polypep- 
tide of the present Invention is selected as a hybridoma capable of producing a monoclonal antibody of tiie present 
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producing a monoclonal antibody of the present Invention, 
(d) Preparation of monoclonal antibody 

vol. 2 (1 983) & vol. 3 (1 985); Practice anrf T^f^l^^f^^^^^^^T^ Hartl^aboratonr 
/mmunosort^nf Assay f£iJS/»;, Igaku Sho.n (1976) ; Anl«KX*^- A t^^ 

(1988); Monoclonal Antibody Experiment Manual. Kodansha Soentific (^^S^^- f^"^^ ^ ^ 
Coi/ree Vol 5. Immunobiochemistiy Research Method. Tokyo Kagaku Doj.n ( 986)). 

mtino/.. tOft 129 (1972); J. /mmuno/..Metf.., 215 (1979)) and fteUte the preaence or absence 

rn<»ii5i Fxnressiori of the polypeptide of the preseritmveritlon, fluctuation of me expression, we piBSBi 

using the antibody of the present invention. 
12. Production and use of polypeptide an^y 
(1) Production of polypeptide an-ay 

[03171 A poWWW. «», CM b. produced csi.s »•» PolW=P«<te of .he p««»» In»n0.n cbBln«l In th. »». 

polySrylamlde or the like; complex carbohydrates, such as agarose, sepharose. or the like, silica, a silica-based ma 

. Br^ S (1974); A^nces in Experimental Medicine and Biology. 42 (1 974); U.S. Patent 4.681 .870. U.S. Patent 

?het:Sr^^^^^^^^ efficiently performed by adheHng the po^peptlde or an.bo^ of the 

prSllt in^enZHe solid support at a high density, though a high fixation density is not always necessary. 
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(2) Use of polypeptide array 

[0322] A polypeptide or a compound capable of binding to and interacting with the polypeptides of the present In- 
vention adhered to the array can be identified using the polypeptide an^y to which the polypeptides of the present 
invention have been adhered thereto as described in the above (1 ). 

[0323] Specifically, a polypeptide or a compound capable of binding to and Interacting with the polypeptides of the 
present invention can be Identified by subjecting the polypeptides of the present Invention to the following steps (I) to (iv): 

(0 preparing a polypeptide an^y having the polypeptide of the present invention adhered thereto by the method 
of the above (1); 

(II) incubating the polypeptide immobilized on the polypeptide array together with at least one of a second potypep- . 
tide or compound; 

(III) detecting any complex formed between the at least one of a second polypeptide or compound and the polypep- 
tide immobilized on the array using, for example, a label bound to the at least one of a second polypeptide or 
compound, or a secondary label which specifically binds to the complex or to a component of the complex after 
unbound material has been removed; and 

(iv) analyzing the detection data. 

[0324] Specific examples of the polypeptide array to which the polypeptide of the present Invention has been adhered 
include a polypeptide array containing a solid support to which at least one of a polypeptide containing an amino acid 
sequence selected from SEQ ID NOS:3502 to 7001 , a polypeptide containing an amino acid sequence in which at 
least one amino acids is deleted, replaced, Inserted or added In the amino acid sequence of the polypeptide and having 
substantially the same activity as that of the polypeptide, a polypeptide containing an amino acid sequence having a 
homology of 60% or more with the amino acid sequences of the polypeptide and having substantially the same activity 
as that of the polypeptides, a partial fragment polypeptide, and a peptide comprising an amino add sequence of a part 
of a polypeptide. 

[0325] The amount of production of a polypeptide derived from corynef omi bacteria can be analyzed using a polypep- 
tide an^ay to which the antibody of the present invention has been adhered In the above (1). 

[0326] Specifically, the expression amount of a gene derived from a mutant of coryneform bacteria can be analyzed ' 
by subjecting the gene to the following steps (I) to (iv): 

(I) preparing a polypeptide array by the method of the above (1); 

(ii) incubating the polypeptide array (the first antibody) together with a polypeptide derived from a mutant of co- 
ryneform bacteria; 

(iii) detecting the polypeptide bound to the polypeptide immobilized on the array using a labeled second antibody 
of the present Invention; and 

(h/) analyzing the detection data. 

[0327] Specific examples of the polypeptide anray to which the antibody of the present Invention is adhered include 
a polypeptide array comprising a solid support to which at least one of an antibody which recognizes a polypeptide 
comprising an amino acid sequence selected from SEQ ID NOS:3502 to 7001, a polypeptide comprising an amino 
acid sequence in which at least one amino acids is deleted, replaced, inserted or added In the amino acid sequence 
of the polypeptide and having substantially the same activity as that of the polypeptide, a polypeptide comprising an 
amino acid sequence having a homology of 60% or more with the amino acid sequences of the polypeptide and having 
substantially the same activity as that of the polypeptides, a partial fragment polypeptide, or a peptide comprising an 
amino acid sequence of a part of a polypeptide. 

[0328] A fluctuation In an expression amount of a specific polypeptide can be monitored using a polypeptide obtained 
In the time course of culture as the polypeptide derived from coryneform bacteria. The culturing conditions can be 
optimized by analyzing the fluctuation. 

[0329] When a polypeptide derived from a mutant of corynefomn bacteria is used, a mutated polypeptide can be 
detected. 

13. Identification of useful mutation in mutant by proteome analysis 

[0330] Usually, tiie proteome is used herein to refer to a method wherein a polypeptide Is separated by twodlmen- 
slonal electrophoresis and the separated polypeptide is digested with an enzyme, followed by Identification of the 
polypeptide using a mass spectrometer (MS) and searching a data base. 

[0331] The two dimensional electrophoresis means an eiectrophoretic method which is perfomied by combining two 
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electrophoretic procedures having different principles. For example, polypeptides are separated d^P^'^'^S^"".^"'^ 
uSTwciSE the primanr electrophoresis. Next, the gel is rotated by 90' or 1 and the sec^nda^ tSo^T 
is carried out depending on Isoelectric point. Thus, various separation patterns can be achieved (JIS K 3600 2474)^ 
oS3 In^aSg t?e data base, the amino acid sequence Information of the P^-^f^^;!^^^^^'''' 
and the recording mldium of the present Invention provWefor in the above items 2 and 8 «^«'^^« "J^" „,^„,ae 
Sm31 The proteome analysis of a coiynefomi bacterium and its mutant makes rt possible to identity a polypeptide 

^TXmfaSr:^^ Wild type strain of corynefom, bacteria and a P-'^"^^'^" r^tt"SS.M.; 
prSS produS of a target product malces it possible to efficiently identity a mutat on P^^'" * » 
S^SnS for impraving the productivity of a target product or a protein of which expression amount is fl"«=^a «^ _ 
rSS SDeSSawlldVstra^^^^^ 

S m^prime^JiiJ T^^ a spot increased In the lysine^roduclng strain, compared with the w.ld^pe stmm. « 
ou^S and aTata base Is searched so that a polypeptide showing an increase in yieW .^'f "'^^'^^^.r^^rj^^^^^^ 
ZTe iTe productivity can be identified. For example, as a «sult of the proteome ana^*^ °" ^ Jje stm.n and 
a lysinlproducing strain, the productivity of the catalase having the amino acid sequence represented by SEQ ID NO. 

seauence Information and the amino add sequence Infomiation. of the genome of the corynefomi bactena of the 
p^entTnJenrn. and a recording medium storing the sequences, the nucleotide sequence of 

and tSLucleotide sequence In the upstream thereof can be se 
seauence havinq a hiqh expression promoter can be efficiently selected. „ ^ , 

f^^^n tre DroteoL analysis, aspot on the two-dlmentlonalelectroph 

de?v^t.^,?amoSmte^^^ However, the modified protein can be efficiently Identified using the reco«li^a^um 
JtorirS^mercSdesiuence infomiation. the amino acid sequence infomnation. of the genome of conrnefomi bac- 
teria and the recording medium storing the sequences, according to the present invention. „^ 

Mo^r a useful mutation Joint In a useful mutant can be easily specified by searching a ""cleoUde s^ 
Wii (nucleotid; sequence of promoters. ORF. or the like) relating to the thus identtied protein -^S « ^^^^-^f 
miiS^ storing the nucleotide sequence Infomiation and the amino acid sequence intomiation 9^3« 
SyneToHaSteria of the present invention, and a recording medium storing ^esequenc^ and using de- 
signed on the basis of the detected nucleotide sequence. As a result that the useful mutaton point Is 
SuSiSi usll mutant having the useful mutation or other useful mutation derived 'l]^'^'">^^"^^J^^ 

The present Invention will be explained In detail below based on Examples. However, the present inventton 
is not limited thereto. 

Example 1 

Determinatton of the full nucleotide sequence of genome of CorynebadBrtum glutamteum 

[03401 The full nucleotide sequence of the genome of Corynebacterium glutamlcum was f °" 
wS qenome shotgun method {Science. 26». 496-512 (1995)). in this method, a genome library was Prepared and 
The ttrSSnSTequS were deiemilned at random. Subsequently, these sequences were ligated on a computer to 
cover the full genome. Specifically, the following procedure was carried out. 

(1) Preparation of genome DNA of Corynebacterium gMairOcum ATCC 13032 

103411 Corynebacterium glutamlcum ATCC 13032 was cultured in BY medium (7 g/l rneat extract, Wj;!"!' J 
^^sodlumch^toride.5gnyeL extract. pH7.2)containlng1%ofgij^neat3^^^^^ 

by centritugallon. After washing with STE buffer (10.3% sucrose. 25 mmoi/l Trls hydrochloride^ 25 mmoW EDTA. PH 
8 ortfm cells were suspended in 10 ml of STE butler containing 10 mg/ml lysozyme. followed by gently shaking at 
3^CtoMhour?hen2m^^^ 

r6^.Cfor1ol^ISLthencooledtoroomtemperature.Then^ 

followed by gently shaking at room tempemture for 30 minutes and centrifugatlon (15.000 x g, 2° '"'"^J^OCVnte 
aqueous iayerwL separated and subjected to extraction with phenol/chlorotom™ and extrac^on e^.'^^'^j*!" 
Xesame manner To the aqueous layer. 3 mol/1 sodium acetate solution (pH 5.2) and isopropanol were added at 
"/iTtZ vSZ an^ twicelolume. rLpecth,e^. followed t>y gently stim^ng to P-'P^^^« 
aenome DNA was dissolved again in 3 ml of TE buffer (1 0 mmol/l Tris hydrochlonde. 1 mmoifl EDTA. pH 8^0) containing 
§ 02 Tg/mltf Sase and mSntelned at 3rC tor 45 minutes. The extractions with P^-^^f 
chloroform were carried out successively in the same manner as the above. The genome DNA was subjected to iso- 
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propanol precipitation. The thus formed genome DNA precipitate was washed with 70% ethanol three times, followed 
by air-drying, and dissolved in 1 .25 ml of TE buffer to give a genome DNA solution (concentration: 0.1 mg/ml). ' 

(2) Construction of a shotgun library 

5 

[0342] TE buffer was added to 0,01 mg of the thus prepared genome DNA of Corynebacterium glutamicum ATCC 
13032 to give a total volume of 0,4 ml, and the mixture was treated with a sonlcator (Yamato Powersonlc Model 150) 
at an output of 20 continuously for 5 seconds to obtain fragments of 1 to 10 Icb. The genome fragments were blunt- 
ended using a DNA blunting kit (manufactured by Takara Shuzo) and then fractionated by 6% polyacrylamide gel 
to electrophoresis. Genome fragments of 1 to 2 kb were cut out from the gel, and 0.3 ml MG elutlon buffer (0.5 mol/l 
ammonium acetate, 10 mmol/l magnesium acetate, 1 mmol/l EDTA. 0.1 %SDS) was added thereto, followed by shaking 
at 37^C overnight to elute DNA. The DNA eluate was treated with phenol/chlorofonn, and then precipitated with ethanol 
to obtain a genome library Insert. The total Insert and 500 ng of pUC18 SmaUBfikP (manufactured by Amersham Phar- 
macia Biotech) were iigated at 1 6^C for 40 hours. 

[0343] The ligation product was precipitated with ethanol and dissolved in 0.01 ml of TE buffer. The ligation solution 
(0,001 ml) was introduced into 0.04 ml of E. oott ELECTRO MAX DH1 OB (manufactured by Life Technologies) by the 
electroporatlon under conditions according to the manufacture's Instructions. The mixture was spread on LB plate 
medium (LB medium (10 g/1 bactotrypton, 5 g/l yeast extract. 10 g/l sodium chloride, pH 7.0) containing 1 .6% of agar) 
containing 0.1 mg/ml amplclllln,0.1 mg/ml X-gal andl mmol/l Isopropyl-p-D-thlogalactopyranoslde(IPTG) and cultured 
20 at 37*C overnight. 

[0344] The transfomnant obtained from colonies formed on the plate medium was stationarily cultured In a 96-well 
titer plate having 0.06 ml of LB medium containing 0.1 mg/ml amplcillln at 37*C ovemlght. Then, 0.05 ml of LB medium 
containing 20% glycerol was added thereto, followed by stining to obtain a glycerol stock. 

2S (3) Construction of cosmid library 

[0345] About 0.1 mg of the genome DNA of Corynebacterium glutamicum ATCC 13032 was partially digested with 
Sai^AI (manufactured by Takara Shuzo) and then ultracentrifuged (26,000 rpm, 18 hours, 20'*C) under 10 to 40% 
sucrose density gradient obtained using 10% and 40% sucrose buffers (1 mol/i NaCI. 20 mmol/l Tris hydrochloride, 5 

30 mmol/l EDTA, 1 0% or 40% sucrose, pH 8.0). After the centrifugation. the solution thus separated was fractionated into 
tubes at 1 ml In each tube. After confirming the DNA fragment length of each fraction by agarose gel electrophoresis, 
a fraction containing a large amount of DNA fragment of about 40 kb was precipitated with ethanol. 
[0346] The DNA fragment was Iigated to the SamHI site of superCosI (manufactured by Stratagene) In accordance 
with the manufacture's instructions. The ligation product was incorporated into EscherichiB co// XL-1 -rBlueMR strain 

3s (manufactured by Stratagene) using Gigapack 111 Gold Packaging Extract (manufactured by Stratagene) in accordance 
with the manufacture's instructions. The Escherichia co// was spread on LB plate medium containing 0.1 mg/ml amp- 
lcillln and cultured therein at 37^C overnight to Isolate colonies. The resulting colonies were stationarily cultured at 
37^C ovemlght In a 96-well titer plate containing 0.05 ml of the LB medium containing 0.1 mg/ml ampicillin in each 
well. LB medium containing 20% glycerol (0.05 ml) was added thereto, followed by stirring to obtain a glycerol stock. 

40 

(4) Detemilnatlon of nucleotkle sequence 
(4-1 ) Preparation of template 

^ [0347] The full nucleotide sequence of Corynebacterium glutamicum ATCC 1 3032 was determined mainly based on 
the whole genome shotgun method. The template used In the whole genome shotgun method was prepared by the 
PGR method using the library prepared in the above (2). 

[0348] Specifically, the clone derived from the whole genome shotgun library was inoculated using a replicator (man- 
ufactured by GENETiX) into each well of a 96-well plate containing the LB medium containing 0.1 mg/ml of amptolllin 

so at 0.08 ml per each well and then stationarily cultured at 37^C overnight, 

[0349] Next, the culturing solution was transported using a copy plate (manufactured by Tokken) into a 96-well re- 
action plate (manufactured by PE Blosystems) containing a PCR reaction solution (TaKaRa Ex Taq (manufactured by 
Takara Shuzo)) at 0.08 ml per each well. Then, PCR was candled out in accordance with the protocol by Makino et a/. 
(DNA Research, 5: 1-9 (1998)) using QeneAmp PCR System 9700 (manufactured by PE Blosystems) to amplify the 

ss inserted fragment. 

[0350] The excessive primers and nucleotides were eliminated using a kit for purifying a PCR produ^ion (manufac- 
tured by Amersham Phanmacia Biotech) and the residue was used as the template in the sequencing reaction. 
[0351] Some nucleotide sequences were determined using a double-stranded DNA plasmid as a template. 
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[0352] The double-stranded DNA plasmid as the template was obtained by the following method. 
0M31 -The clone derived from the whole genome shotgun library was Inoculated into a 24- or 96-well plate contaJnhg 
a2?^T,3Slum(%gS^^ 

at 1 5 mi per each well and then cultured under shaking at arc overnight. nr« 
5^41 ^Se double-stranded DNA plasmid was prepared from the cunuring 80 ution u^^ 

paring machine. KURABO PI-50 (manufactured by Kurabo Industries) or a multlscroen (manufactured by IVl.llvore) in 

ssr^;?^rrbts^:d:sK;z^^^^ 
Tc^r^Ti^thirbSn^'i:^^^^ 

mg/ml and used as the template In sequencing. 
(4-2) Sequencing reaction 

[0357] TO 6 Of a solution of ABI PRISM BigDye Temilnator Cycle Sequencing Ready Reaction ^^J^J'^^^'^"'^ 
by p2 Biosystems). an M13 regular direction primer (M13-21) or an M13 reverse direction pnmer (^If V) (DNJ 
Reseats- 1-9 ( 998) and the template prepared in the above (4-1) (the PCR product or the plasmid) were added 
XoT^Xo 5' i a sequi^ing reaction solLn. The primes and the templates were used in an amount of 1 .6 pmol 

. S'ssS; "Srt— ^.leS roaSn of 45 cycles was carried out w«h GeneAmp PCR System 9700 (manu- 
Sd S??E fflosystems? using the reaction solution. The cycle parameter w^ detemjined in ac~^ the 
rmnXJrer-s Inst^cUon accompanying ABI PRISM BigDye Temilnator Cycle 9 "«^2,5!S^^^Lm<J 

s^Dle was purified using Multiscreen HV plate (manufactured by Millipore) according to the n^anufactures instruc^ 
^l!:XspZ^Z<^\on product was prooipitated with ethanol. followed by drying, and then storod .n the daric 

' SsSI'^The d,y reaction product was analyzed by ABI PRISM 377 DNA Sequencer and ABIP^SM 3700 DNA An- 
alvzer (both manufactured by PE Biosystems) each in accordance with the manufacture's instructions. 
SeoV T^eda'LofaboutsS.OOOseqL^^ 

er and about 8.000 reactions obtained by 3700 DNA Analyser) were transferred to a ^'^l^?^^^^;^^ 
0 Sfactured by COMPAQ) and stored. The data of these about 50.000 sequences conresponded to 6 times as much as 
the genome size. 

(5) Assembly 

as [0361] All opemtions were carried out on the basis of UNIX platfomi. The analytical data were ""j^"* J^J^^^^ 
platfoln using X Window System. The base call was carried out using phred (Hie 
iZor sequence data was deleted using SPS Cross.Match (manufactured by Southwest Parallel 
s^tr, wL carried out using SPS phrap (manufactured by Southwest Parallel Software; a ^f'^^^^'^^^^^ 
(The university of Washington)). The contig obtained by the assembly was analyzed "«'"9 « fj^;; ~"'^f 

40 (SJunlversSofWashinitoni'Aseriesoftheoperatlonsfromthebasecalitotheassembiyw 

taneously using a script phredPhrap attached to consed. 

(6) Determination of nucleotide sequence in gap part 

[0362] Each cosmld in the cosmid library constructed in the above (3) was prepared by a 
D^aration of the double-stranded DNA plasmid described in the above (4-1). The nucleotide sequence at the end of 
fh^CiSJragmem ofthe cosmid was detemiined by using ABI PRISM BigDye Temiinator Cycle Sequencing Ready 
Reaction Kit (manufactured by PE Biosystems) according to the manufacture's instructions. 
[0363] About 800 cosmid clones were sequenced at both ends to search a nucleotide sequence n the cont^^errved 
rom the shotgun sequencing obtained In the above (5) coincident with the sequence^Thus. the ''"^age between r^ 
spective cosr^id clones and respective contigs were detemilned and mutual ^^^^^^^'^T'.Z 
the results were compared with the physical map of Corynebacterlum glutamicum ATCC 13032 (Mo/. Gen. Genet. 
252 255-265 (1996) to carrying out mapping between the cosmids and the contigs. ^othnd 
[^1 The sequence In the region which was not covered with the contigs was determined by the following method 
036q Clones containing sequences positioned at the ends of contigs were selected. Among t^"*^^ 
1 .000 ctones wherein only one end of the inserted fragment had been determined were selected ^""IJ^^ ^^^"^"^ «j 
the opposite end of the inserted fragment was detemiined. A shotgun library clone or a cosmid clone containing the 
s^uenc^s at the respective ends of the inserted fragment In two contigs was Identified, the full nucleot«te sequence 
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of the Inserted fragment of this clone was determined, and thus the nucleotide sequence of the gap part was determined. 
When no shotgun library clone or cosmid clone covering the gap part was available, primers complementary to the 
end sequences at the two contigs were prepared and the DNA fragment In the gap part was amplified by PCR. Then, 
sequencing was perfomned by the primer walking method using the amplified DNA fragment as a template or by the 
shotgun method In which the sequence of a shotgun clone prepared from the amplified DNA fragment was determined. 
Thus, the nucleotide sequence of the domain was detemnlned. 

[0366] In a region showing a low sequence precision, primers were synthesized using AUtOFINiSH function and 
NAVIGATING function of consed (The University of Washington) and the sequence was determined by the primer 
walking method to Improve the sequence precision. The thus detennlned full nucleotide sequence of the genome of 
Corynebacterium gtutamicum ATCC 1 3032 strain is shown in SEQ ID NO:1 . 

(7) Identification of ORF and presumption of its function 

[0367] ORFs in the nucleotide sequence represented by SEQ ID NO:1 were Identified according to the following 
method. Rrst, the ORF regions were detemilned using software for identifying ORF, I.e.. Glimmer, GeneMari( and 
GeneMark.hmm on UNIX platfomi according to the respective manual attached to the software. 
[0368] Based on the data thus obtained, ORFs In the nucleotide sequence represented by SEQ ID NO:1 were Iden- 
tified. 

[0369] The putath^e function of an ORF was determined by searching the homology of the Identified amino acid 
sequence of the ORF against an amino acid database consisting of protein-encoding domains derived from Swiss- 
Prot, PIR or Genpept database constituted by protein encoding domains derived from GenBank database, Frame 
Search (manufactured by Compugen), or by searching the homotogy of the identified amino add sequence of the ORF 
against an amino acid database consisting of protein-encoding domains derived from Swiss-Prot. PIR or Genpept 
database constituted by protein encoding domains derived from GenBank database, BLAST. The nucleotide sequences 
of the thus determined ORFs are shown in SEQ ID NOS:2 to 3501, and the amino acid sequences encoded by these 
ORFs are shown in SEQ ID NOS:3502 to 7001. 

[0370] in some cases of the sequence listings in the present Invention, nucleotide sequences, such as TTG, TGT, 
GGT, and the like, other than ATG, are read as an initiating codon encoding Met. 

[0371] Also, the prefen^ed nucleotide sequences are SEQ ID NOS:2 to 355 and 357 to 3501 , and the preferred amino 
acid sequences are shown in SEQ ID NOS:3502 to 3855 and 3857 to 7001 

[0372] Table 1 shows the registration numbers In the above-described databases of sequences which were judged 
as having the highest homology with the nucleotide sequences of the ORFs as the results of the homology search In 
the amino acid sequences using the homology-searchlng software Frame Search (manufactured by Compugen), 
names of the genes of these sequences, the functions of the genes, and the matched length, identities and analogies 
compared with publtely known amino acid translation sequences. Moreover, the con-esponding positions were con- 
fimried via the alignment of the nucleotide sequence of an arbitrary ORF with the nucleotide sequence of SEQ ID NO: 
1 . Also, the positions of nucleotide sequences other than the ORFs (for example, ribosomal RNA genes, transfer RNA 
genes, IS sequences, and the like) on the genome were detemilned. 

[0373] ^ Flg..l shows the positions ottypteal genes-of the Corynebacterium gtutamicum AircCA3032 on the genome. 
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5 
10 




Function 


hypothetical membrane protein 


2.5-diketo-D-gluconic acid reductase 


S'-nudeotidase precursor 


5'-nudeotidase family protein 


transposase 


organic hydroperoxide detoxication 
enzyme 


ATP-dependent ONA helicase 
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lipoprotein 


ABC 3 transport faoiily or integral 
membrane protein 


iron(lll) didtrate transport ATP- 
biding protein 


sugar ABC transporter, periplasmic 
sugar-binding protein 


high affinity ribose transport protein 


ribose transport ATP-binding protein | 


neurofilament subunit NF-160 


peptldyi-prolyl cis-trans isomerase A 


hypothetical membrane protein 
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63.7 


74.1 


70.3 


56.5 


66.3 


76.7 
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Identity 
(%) 


1 24.9 


65.4 


27.0 


27.0 


52.9 


51.8 


32.7 




26.7 


28.9 


34.6 


39.2 


25.8 


30.5 


32.2 


23.6 
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29.2 
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Table 1 (continued) 


Homologous gene 


Mycobacterium leprae 
MLCB1788.18 


Corynebacterium sp. ATCC 
31090 


Vibrio parahaemolyticus nutA 


Deinococcus radiodurans 
DR0505 


Corynebacterium striatum ORFl 


Xanthomonas campestris 
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Thiobadllus ferrooxidans recG 




Saccharomyces cerevisiae 
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TM0114 
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Petromyzon marinus 
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Bacillus subtltis 168 yqgP 
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Function 


hypothetical protein | 


hypothetical protein 


hypothetical protein 




hypothetical protein 






magnesium and cobalt transport 
protein 




chloride channel protein 1 


required for NMN transport 


phosphate starvation-induced 
protein-like protein 








Mg(2'*-)/citrate complex secondary 
transporter 
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histidine kinase 




transcriptional regulator 


D-isomer specific 2-hydroxyadd 
dehydrogenase 
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Table 1 (continued) 


Homologous gene 
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hypothetical protein 
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5*-phosphoribosyl-4-N- 
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peptide synthase 




phenylacetaldehyde dehydrogenase 


hypothetical protein 


hypothetical protein 
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insertion element (IS3 related) 


insertion element (IS3 related) 


two-component system sensor 
histidine kinase 

transcriptional regulator 
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hypothetical protein 
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Table 1 (conlinued) 


Honf\ologous gene 


Corynebacterium glutamicum 
ort2 


Corynebacterium glutamicum 
ort1 
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ammoniagenes purA 
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Table 1 (continued) 


Homologous gone 


Streptomyces coelicolor A3(2) 
SC6G4.33 








Streptomyces lavendulae 
ORF372 


Saccharomyces cerevisiae 
S288C YIR019C stal 




Bacillus subtilis gIpQ 


Bacillus subtilis gntP 






Corynebacterium glutamlcum 
AS019pyk 


Brevibacterium flavum IctA 


Mycobacterium tuberculosis 
H37RvRv1069c 


Streptomyces coelicolor A3(2) 
SC1C2.30 


Brevibacterium linens 0RF1 
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Escherichia coli K12 MG1655 
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Function 


L-lactate dehydrogenase or FMN- 
dependent dehydrogenase 




immunity repressor protein 






phosphatase or reverse 

1 transcriptase (RNA-dependent) 


I 


peptidase or lAA-amino acid 
hydrolase 




peptide methionine sulfoxide 
reductase 


superoxide dismutase (Fe/Mn) 


transcriptional regulator 


multidrug resistance transporter 








hypothetical protein 


membrane transport protein 


transcriptional regulator 
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two-component system sensor 
histidine kinase 


hypothetical protein 


hypothetical protein 


stage III sporulation protein 


transcriptional repressor 


transgiycosylase-associated protein 


hypothetical protein 


hypothetical protein 
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Function 












methyltransferase 


nodulin 21-felated protein 








transposon tn501 resolvase 




ferredoxin precursor 


hypothetical protein 


transposase 


transposase protein fragment 
TnpNC 




glyceraldehyde'3-phosphale 
1 dehydrogenase (pseudogene) 


lipoprotein 


copper/potassium-transporting 
ATPase B or cation transporting 
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Table 1 (continued) 


Homologous gene 












Streptomyces coelicolor A3(2) 
SCD35.11C 
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Pseudomonas aeruginosa TNP5 




Saccharopolyspora erythraea fer 
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Function 


transposase (IS1626) 


thioredoxin 
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transmembrane transport protein or 
4-hydroxybenzoate transporter 




hypothetical protein 


replicative ONA helicase 


! 
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SOS ribosomal protein L9 


single-Strand DNA binding protein 


30S ribosomat protein 86 




hypothetical protein 




penicillin-binding protein 


hypothetical protein 


bacterial regulatory protein, marR 
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Table 1 (continued) 


Homologous gene 
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salicylate hydroxylase 


proton/glutamate symporter or 
excitatory amino acid transporter2 
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anthranilate synthase component 1 
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Function 


maleylacetate reductase 


sugar transporter or Oxylose-proton 
symporter (Oxylose transporter) 


bacterial transcriptional regulator or 
acetate operon repressor 


oxidoreductase 


diagnostic fragment protein 
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myo-inositol 2-dehydrogenase 


dehydrogenase or myo-inositol 2- 
dehydrogenase or streptomycin 
biosynthesis protein 
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heavy-metal-associated domain 
containing protein 


ectoine/proline uptake protein 


Matched 
length 
(aa) 


in 

CO 


CO 


o 

CD 
CN 


in 

CO 


a 

CN 


CO 


CO 

CO 


1242 








8 

CM 




1660 






CN 


CO 


Oi 
CN 


Similarity 
(%) 


75.5 


58.3 


60.7 


55.7 


58.2 


59.6 


CN 
CO 


62.7 








57.3 




80.2 


61.0 




76.8 


70.1 


62.3 


Identity 
(%) 


43.0 


31.4 


25.7 


27.2 


25.9 


26.5 


34.1 


33.3 








26.6 




56.4 


34.6 




50.4 


46.3 


29.9 


Homologous gene 


Pseudomonas sp. P51 


Escherichia coli K12xy1E 


Salmonella lyphimurium icIR 
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Table 1 (continued) 
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diaminopimelate (DAP) 
decarboxylase (meso- 
diaminopimelate decarboxylase) 
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lysine exporter protein 
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Example 2 

Determination of effective mutation site 

5 (1 ) Identification of mutation site based on the comparison of tlie gene nucleotide sequence of lysln&i^roducing B-6 
strain with that of wild type strain ATCC 1 3032 

[0374] Corynebactehum glutamicum B-6, which is resistant to S-(2-aminoethyl)cysteine (AEC), rifampicln, strepto- 
mycin and 6-azauracll, is a lysine-produclng mutant having been mutated and bred by subjecting the wild type ATCC 

10 13032 strain to multiple rounds of random mutagenesis with a mutagen, N-methyl-N' -nitro-N-nltrosoguanldine (NTG) 
and screening {Appl. MicrobioL Biotechnol., 32: 269-273 (1989)). First, the nucleotide sequences of genes derived 
from the B-6 strain and considered to relate to the lysine production were determined by a method similar to the above. 
The genes relating to the lysine production include tysE and tysG which are lysine-excreting genes; ddh, dapA, horn 
and lysC (encoding diaminopimelate dehydrogenase, dihydropicolinate synthase, homoserine dehydrogenase and 

IS aspartokinase, respectively) which are lyslne-biosynthetic genes; and pyc and ziv/ (encoding pyruvate carboxylase 
and glucose-6-phosphate dehydrogenase, respectively) which are glucose-metabolizing genes. The nucleotide se- 
quences of the genes derived from the production strain were compared with the corresponding nucleotide sequences 
of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and analyzed. As a result, mutation points 
were obsen/ed in many genes. For example, no mutation site was obsen/ed in lysE, iysG, ddh, dapA, and the like, 

20 whereas amino acid replacement mutations were found In horn, iysC, pyc, zwf, and the like. Among these mutation 
points, those which are considered to contribute to the production were extracted on the basis of known biochemical 
or genetic infomiation. Among the mutation points thus extracted, a mutation, Val59Ala, In horn and a mutation, 
Pro458Ser, in pyc were evaluated whether or not the mutations were effective according to the following method. 

2s (2) Evaluation of mutation, Vai59Ala, in horn and mutation, Pro458Ser, in pyc 

[0375] It is known that a mutation in hom inducing requirement or partial requirement for homoserine imparts lysine 
productivity to a wild type strain (>4mi/io Ac/dFe/mentetfoaed.by HIroshI Aida etaL, Japan Scientific Societies Press). 
However, the relationship between the mutation, Val59Ala. in hom and lysine productton Is not known. It can be ex- 

30 amined whether or not the mutation, Val59Aia, in hom Is an effective mutation by Introducing the mutation to the wild 
type strain and examining the lysine productivity of the resulting strain. On the other hand, it can be examined whether 
or not the mutation. Pro45BSer, In pyc is effective by introducing this mutation into a lysine-producing strain which has 
a deregulated lysine-bioxynthetic pathway and is free from the pyc mutation, and comparing the lysine productivity of 
the resulting strain with the parent strain. As such a lysine-producing bacterium, No. 58 strain (PERM BP-7134) was 

35 selected (hereinafter referred to the "lysine-produclng No. 58 strain* or the "No. 58 strain"). Based on the above, it was 
detemilned that the mutation. Val59Ala, In hom and the mutation, Pro458Ser. In pyc were Introduced into the wild type 
strain of Corynebacterium glutamicum ATCC 13032 (hereinafter refenred to as the "wild type ATCC 13032 strain" or 
the "ATCC 13032 strain") and the lysine-produclng No. 58 strain, respectively, using the gene replacement method. A 
plasmid vector pCES30 for the gene replacement for the introduction was constructed by the following method. 

40 [0376] A plasmid vector pCE53 having a kanamycln-reslstant gene and being capable of autonomously replicating 
in Coiynefonn bacteria (MoL Gen, Genet, 196: 175-178 (1984)) and a plasmid pMOB3 (ATCC 77282) containing a 
ievansucrase gene (sacS) of Bacillus subtilts {Molecular Microbiology &. 1195-1204 (1992)) were each digested with 
Psfl. Then, after agarose gel electrophoresis, a pCE53 fragment and a 2.6 kb DNA fragment containing sacB were 
each extracted and purified using GENECLEAN Kit (manufactured by BI0 101). The pCE53 fragment and the 2.6 kb 

45 DNA fragment were ligated using Ligation Kit ver. 2 (manufactured byTakara Shuzo). introduced into the ATCC 13032 
strain by the electroporatlon method (FEMS Microbiology Letters, 65: 299 (1 989)), and cultured on BYG agar medium 
(medium prepared by adding 1 0 g of glucose, 20 g of peptone (manufactured by Kyokuto Pharmaceutical), 5 g of yeast 
extract (manufactured by Difco), and 16 g of Bactoagar (manufactured by DIfco) to 1 liter of water, and adjusting its 
pH to 7.2) containing 25 p.g/ml kanamycin at 30*C for 2 days to obtain a transfonnant acquiring kanamycin-resistance. 

so As a result of digestion analysis with restriction enzymes. It was confirmed that a plasmid extracted from the resulting 
transformant by the alkali SDS method had a structure In whteh the 2.6 kb DNA fragment had been inserted Into the 
Psfl site of pCE53. This plasmid was named pCES30. 

[0377] Next, two genes having a mutation point, hom and pyc, were amplified by PCR, and Inserted into pCES30 
according to the TA cloning method (Bio Experiment Illustrated vol. 3, published by Shujunsha). Speclficaiiy. pCES30 
55 was digested with BarrA-W (manufactured by Takara Shuzo), subjected to an agarose gel electrophoresis, and extracted 
and purified using GENECLEAN Kit (manufactured by BI0 1 01 ). The both ends of the resulting pCES30 fragment were 
blunted with DNA Blunting Kit (manufactured by Takara Shuzo) according to the attached protocol. The blunt-ended 
pCES30 fragment was concentrated by extraction with phenol/chlorofonm and precipitation with ethanoi, and allowed 
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to react In the presence of Taq polymerase (manufactured by Roche Diagnostics) and cTTTP at 70»C for 2 hours so 
that a nucleotide, thymine (T), was added to the 3'-end to prepare a T vector of pCES30. 

[0378] Separately, chromosomal DNA was prepared from the lyslne-producing B-e strain according to the method 
of Saito et al. {Bhchem. Biophys. Acta, 72: 61 9 (1 963)). Using the chromosomal DNA as a template. PGR was carried 
out with Pfu turbo DNA polymelase (manufactured by Stratagene). In the mutated horn gene, the DNAs having the 
nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were used as the primer set. In the mutated pyc 
gene, the DNAs having the nucleotide sequences represented by SEQ ID NOS:7004 and 7005 were used as the primer 
set. The resulting PGR product was subjected to agarose gel electrophoresis, and extracted and purified using GENE- 
GLEAN Kit (manufactured by BI0 1 01 ). Then, the PGR product was allowed to react In the presence of Taq polymerase 
(manufactured by Roche Diagnostics) and dATP at 72»C for 10 minutes so that a nucleotide, adenine (A), was added 
to the 3*-end. 

[0379] The above pCES30 T vector fragment and the mutated horn gene (1 .7 Icb) or mutated pyc gene (3.6 kb) to 
which the nucleotide A had been added of the PGR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol. and then ligated using Ligation Kit ver. 2. The ligation products were Introduced Into the 
ATCG 13032 strain according to the electropo ration method, and cultured on BYG agar medium containing 25 jig^ml 
kanamycin at 30»G for 2 days to obtain kanamycln-reslstant transformants. Each of the resulting transformants was 
cultured overnight In BYG liquid medium containing 25 jig/hil Icanamycln, and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes. It was 
conflmned that the piasmid had a structure In which the 1 .7 kb or 3.6 kb DNA fragment had been inserted into pGES30. 
The plasmlds thus constructed were named respectively pGhom59 and pGpyc468. 

[0380] The Introduction of the mutations to the wild type ATGG 13032 strain and the lyslne-producing No. 58 strain 
according to the gene replacement method was carried out according to the following method. Specifically, pGhom59 
and pGpyc458 were Introduced to the ATGG 13032 strain and the No. 58 strain, respectively, and strains In which the 
plasmid is integrated Into the chromosomal DNA by homologous recombination wer© selected using the method of 
Ikeda et al. (Microbiofogy 144: 1863 (1998)), Then, the stains In whteh the second homologous recombination was 
carried out were selected by a selection method, making use of the fact that the Bacillus subtllis levansucrase encoded 
by PGES30 produced a suicidal substance (J. of BacterioL, 174: 5462 (1 992)). Among the selected strains, strains In 
whteh the wild type horn and pyc genes possessed by the ATGG 1 3032 strain and the No. 58 strain were replaced with 
the mutated horn and pyc genes, respectively, were Isolated. The method is specifically explained below. 
30 [0381] One strain was selected from the transfomnants containing the plasmid. pGhom59 or pGpyc458, and the 
selected strain was cultured In BYG medium containing 20 jig/hal kanamycin, and pCGII (Japanese Published Exam- 
ined Patent Application No. 91827/94) was introduced thereinto by the electroporation method. pCGII is a plasmid 
vector having a spectlnomycin-reslstant gene and a replteation origin whteh is the same as pCE53. After Introduction 
of the pCGII, the strain was cultured on BYG agar medium containing 20 |xg/ml kanamycin and 1 00 jig/ml spectinomycin 
35 at 30*C for 2 days to obtain both the kanamycin- and spectlnomycin-resistant transfonmant. The chromosome of one 
strain of these transformants was examined by the Southern blotting hybridization according to the method reported 
by Ikeda et al. (Microbiology, 144: 1 863 (1 998)), As a result, It was confirmed that pGhom59 or pCpyo458 had been 
integrated Into the chromosome by the homologous recombination of the Cambell type. In such a strain, the wild type 
and mutated horn or pyc genes are present closely on the chromosome, and the second homologous recombination 
^ is liable to arise therebetween. 

[0382] Each of these transformants (having been recomblned once) was spread on Sue agar medium (medium 
prepared by adding 1 00 g of sucrose, 7 g of meat extract. 1 0 g of peptone, 3 g of sodium chloride, 5 g of yeast extract 
(manufactured by Difco), and 18 g of Bactoagar (manufactured by Difco) to 1 liter of water, and adjusting Its pH 7.2) 
and cultured at 30*C for a day. Then the colonies thus growing were selected In each case. Since a strain in whteh the 
sacB gene Is present converts sucrose into a suicide substrate. It cannot grow in this medium (J. Bacterial., 174: 5462 
(1 992)). On the other hand, a strain in which the sacB gene was deleted due to the second homologous recombination 
between the wild type and the mutated horn or pyc genes positioned closely to each other fonms no sutelde substrate 
and, therefore, can grow In this medium. In the homologous recombination, either the wild type gene or the mutated 
gene Is deleted together with the sacB gene. When the wild type is deleted together with the sacB gene, the gene 
^ replacement Into the mutated type arises. 

[0383] Ghromosomal DNA of each the thus obtained second recombinants was prepared by the above method of 
Salto etai. PGR was carried out using Pfu turbo DNA polymerase (manufactured by Stratagene) and the attached 
buffer. In the i)om gene, DNAs having the nucleotide sequences represented by SEQ ID NOS:7002 and 7003 were 
used as the primer set. Also, In the pyc gene was used, DNAs having the nucleotide sequences represented by SEQ 
ID NOS:7004 and 7005 were used as the primer set. The nucleotide sequences of the PGR products were determined 
by the conventional method so that It was judged whether the /?om or pyc gene of the second recombinant was a wild 
type or a mutant. As a result, the second recombinant which were called HD-1 and No. 58pyc were target strains having 
the mutated ham gene and pyc gene, respectively. 
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(3) Lysine production test of HD-1 and No. 58pyc strains 

[03841 The HD-1 strain (strain obtained by incorporating the mutation. Val59Aia. In the horn gene into the ATCC 
1 3032 strain) and the No. 58pyc strain (strain obtained by incon^orating the mutation. Pro458Ser in the P/f S^ne Into 
the Vsine-producing No. 58 strain) were subjected to a culture test in a 5 I Jar femnenter by using the ATCC 13032 
strain and the iysine-producing No. 58 strain respectively as a control. Thus lysine production was examined. 
[03851 After culturing on BYG agar medium at 30»C for 24 hours, each strain was inoculated into 250 nril of a seed 
medium (medium prepared by adding 50 g of sucrose. 40 g of com steep liquor 8.3 g of ammonium su»ate 1 g of 
urea. 2 g of potassium dihydrogenphosphate. 0.83 g of magnesium sulfate heptahydrate. 10 mg of iron sulfate hep- 
tahydrate, 1 mg of copper sulfate pentahydrate. 1 0 mg of zinc sulfate heptahydrate. 1 0 mg of p-alanme 5 mg of n.cohnic 
acid. 1 .5 mg of thiamin hydrochloride, and 0.5 mg of biotin to 1 liter of water, and adjusting its plH to 7.2. t^ento which 
30 Q of calcium carbonate had been added) contained in a 2 1 buffie-attached Erienmeyer flasic and cultured therein 
at 30«C for 12 to 1 6 hours. A total amount of the seed culturing medium was inoculated into 1 .400 mi of a mam culture 
medium (medium prepared by adding 60 g of glucose. 20 g of com steep liquor, 25 g of ammonium 2-5 9 «J 

potassium dihydrogenphosphate. 0.75 g of magnesium sulfate heptahydrate. 50 mg of iron sulfate heptahydrate. 13 
mg of manganese sulfate pentahydrate. 50 mg of calcium chloride. 6.3 mg of copper sulfate pentahydrate. 1 .3 mg of 
zinc sulfate heptahydrate. 5 mg of nickel chloride hexahydrate. 1 .3 mg of cobalt chloride hexahydrate. 1 .3 mg of am- 
monium molybdenate tetrahydrate. 14 mg of nicotinic acid, 23 mg of p-alanine. 7 mg of thiamin hydrochloride and 
0 42 mg of biotin to 1 liter of water) contained in a 5 1 jar fennenter and cultured therein at 32«C. 1 wm and 800 rpm 
while controlling the pH to 7.0 with aqueous ammonia. When glucose in the medium had been consumed, a glucose 
feeding solution (medium prepared by adding 400 g glucose and 45 g of ammonium chloride to 1 liter of water) was 
continuously added. The addition of feeding solution was carried out at a controlled speed so as to maintain the dis- 
solved oxygen concentration within a range of 0.5 to 3 ppm. After culturing for 29 hours, the culture was temiina ed 
The cells were separated from the culture medium by centrtfugation and then L-lysine hydrochlonde in the supernatant 
was quantified by high performance liquid chromatography (HPLC). The results are shown in Table 2 below. 



Table 2 


Strain 


L-Lysine hydrochloride yield (g/l) 


ATCC 13032 


0 


HD-1 


8 


No. 58 


45 


No. 58pyc 


51 



40 



[03861 As is apparent from the results shown in Table 2. the lysine productivity was improved by introducing the 
mutation. Val59Ala. In the horn gene or the mutation. Pro458Ser. in the pyc gene. Accordingly, it was found that the 
mutations are both effective mutations relating to the production of lysine. Strain. AHP-3, in which the mutetion, 
Val59Ala. in the homgene and the mutation. Pro4S8Ser. in the p^cgene have been introduced Into the wild ^pe ATCC 
13032 strain together with the mutation, Thr331lle in the lysC gene has been deposited on December 5. 2000. in 
National Institute of Bioscience and Human Technology. Agency of Industrial Science and Technology (Higashi 1 -1-3, 
Tsukuba-shi. Ibaraki, Japan) as PERM BP-7382. 
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Example 3 

Reconstruction of lyslne-producing strain based on genome information 

[0387] The Iysine-producing mutant B-6 strain (Appl. Microbiol. Biotechnol.. 3Z. 269-273 (1 989)). whK:h has been 
constructed by multiple round random mutagenesis with MTG and screening from the wild type ATCC 13032 strain, 
produces a remari«bly large amount of lysine hydrochloride when cultured in a Jar at 32-C using glucose as a carbon 
source. However, since the fermentation period is long, the production rate is less than 2.1 gM.. Breeding to reconstitute 
only effective mutations relating to the production of lysine among the estimated at least 300 mutations introduced into 
the B-6 strain In the wild type ATCC 13032 strain was performed. 

(1 ) Identification of mutation point and effective mutation by comparing the gene nucleotide sequence of the B-6 strain 
with that of the ATCC 13032 strain 

[0388] As described above, the nucleotide sequences of genes derived from the B-6 strain were compared with the 
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corresponding nucleotide sequences of the ATCC 13032 strain genome represented by SEQ ID NOS:1 to 3501 and 
analyzed to Identify many mutation points accumulated in the chromosome of the B-6 strain. Among these, a mutation 
Val591Ala. In horn, a mutation, Thr311lle. in tysC, a mutation, Pro4S8Ser. In pyc and a mutation, Ala213Thr In zwf 
were specified as effective mutations relating to the production of lysine. Breeding to reconstitute the 4 mutations In 
the wild type strain and for constructing of an Industrially Important lysine-producing strain was carried out accordlna 
to the method shown below. 

(2) Construction of plasmid for gene replacement having mutated gene 

10389] The plasmid for gene replacement, pChom69, having the mutated horn gene and the plasmid for gene re- 
placement, pCpyc458, having the mutated pyc gene were prepared In the above Example 2(2). Plasmids for gene 
replacement having the mutated iysC and zwf were produced as described below. 

[03901 The tysC and zwf having mutation points were amplified by PGR. and Inserted into a plasmid for gene re- 
placement, pCES30, according to the TA cloning method described in Example 2(2) (Blo Experiment Illustrated, Vol. 3). 
(0391 J Separately, chromosomal DNA was prepared from the lysine-producing B-6 strain according to the above 
method of Saito et at. Using the chromosomal DNA as a template, PGR was carried out with Pf u turbo DNA polymerase 
(manufactured by Stratagene). In the mutated lysC gene, the DNAs having the nucleotide sequences represented by 
SEQ ID NOS:7006 and 7007 were used as the primer set. In the mutated zwf gene, the DNAs having the nucleotide 
sequences represented by SEQ ID NOS:7008 and 7009 as the primer set. The resulting PGR product was subjected 
to agarose gel electrophoresis, and extracted and purified using GENEGLEAN Kit (manufactured by BIO 101). Then, 
the PGR product was allowed to react in the presence of Taq DNA polymerase (manufactured by Roche Diagnostics) 
and dATP at 72"G for 10 minutes so that a nucleotide, adenine (A), was added to the 3'-end. 
[03921 The above pGES30 T vector fragment and the mutated fysC gene (1 .5 kb) or mutated zwf gene (2.3 kb) to 
which the nucleotide A had been added of the PGR product were concentrated by extraction with phenol/chloroform 
and precipitation with ethanol. and then ligated using Ligation Kit ver. 2. The ligation products were Introduced Into the 
ATGC 13032 strain according to the electroporatlon method, and cultured on BYG agar medium containing 25 ^g/ml 
kanamycin at 30'C for 2 days to obtain kanamycin-resistant transfomiants. Each of the resulting transformants was 
cultured overnight in BYG liquid medium containing 25 ^g/mi kanamycin. and a plasmid was extracted from the culturing 
solution medium according to the alkali SDS method. As a result of digestion analysis using restriction enzymes, it was 
conf imied that the plasmid had a structure In which the 1 .5 kb or 2.3 kb DNA fragment had been inserted Into pCES30. 
The plasmids thus constructed were named respectively pGlysC311 and pCzwf21 3. 

(3) Introduction of mutation, Thr311 lie, in fysC Into one point mutant HD-1 

S5 [03931 Since the one mutation point mutant HD-1 In which the mutation, Val59Ala, in horn was Introduced into the 
wild type ATGC 1 3032 strain had been obtained in Example 2(2), the mutation. Thr31 1 lie, in iysC was introduced into 
the HD-I strain using pClysG311 produced in the above (2) according to the gene replacement method described In 
Example 2(2). PGR was carried out using chromosomal DNA of the resulting strain and, as the primer set, DNAs having 
the nucleotide sequences represented by SEQ ID NOS:7006 and 7007 In the same manner as In Example 2(2). As a 

^ result of the fact that the nucleotide sequence of the PGR product was determined In the usual manner, It was confirmed 
that the strain which was named AHD-2 was a two point mutant having the mutated iysC gene In addition to the mutated 
horn gene. 
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(4) Introduction of mutation, Pro458Ser, In pyc Into two point mutant AHD-2 



[0394] The mutation. Pro458Ser, in pycyNas Introduced Into the AHD-2 strain using the pGpyc468 produced In Ex- 
ample 2(2) by the gene replacement method described In Example 2(2). PGR was canled out using chromosomal 
DNA of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID 
NOS:7004 and 7005 In the same manner as In Example 2(2). As a result of the fact that the nucleotide sequence of 
so the PGR product was detemilned in the usual manner, it was confinned that the strain which was named AHD-3 was 
a three point mutant having the mutated pyc gene in addition to the mutated horn gene and iysC gene. 

(5) Introduction of mutation, Ala213Thr, in zwf into three point mutant AHP-3 



[0395] The mutation, Ala213Thr, In zwf was introduced into the AHP-3 strain using the pGzwf458 produced In the 
above (2) by the gene replacement method described In Example 2(2). PGR was carried out using chromosomal DNA 
of the resulting strain and, as the primer set, DNAs having the nucleotide sequences represented by SEQ ID NOS: 
7008 and 7009 in the same manner as In Example 2(2). As a result of the fact that the nucleotide sequence of the PGR 
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product was determined in the usual manner, it was confimned that the strain which was named APZ-4 was a four point 
mutant having the mutated zwf gene in addition to the mutated horn gene. lysC gene and pyc gene. 

(6) Lysine production test on HD-1 , AHD-2, AHP-3 and APZ-4 strains 

[0396] The HD-1 , AHD-2, AHP-3 and APZ-4 strains obtained above were subjected to a culture test In a 5 I jar 
fermenter In accordance with the method of Example 2(3). 
[0397] Table 3 shows the results. 

Table 3 



Strain 


L-Lysine hydrochloride (g/1) 


Productivity (g/l/h) 


HD-1 


8 


0.3 


AHD-2 


73 


2.5 


AHP-3 


80 


2.8 


APZ-4 


86 


3.0 



[0398] Since the lysine-producing mutant B-6 strain which has been bred based on the random mutation and selection 
shows a productivity of less than 2.1 g/l/h, the APZ-4 strain showing a high productivity of 3.0 gMi Is useful in Industry. 

(7) Lysine femientation by APZ-4.strain at high temperature 

[0399] The APZ-4 strain, which had been reconstmcted by introducing 4 effective mutations into the wild type strain, 
was subjected to the culturing test in a 5 Ijar fenmenter in the samemanner as in Example 2(3), except that the culturing 
temperature was changed to 40®C. 
[0400] The results are shown in Table 4. 



Table 4 



Temperature (•C) 


L-Lyslne hydrochloride (g/i) 


Productivity (g/l/h) 


32 
40 


86 
95 


3.0 
3.3 



[0401] As is apparent from the results shown In Table 4, the lysine hydrochloride titer and productivity in cultunng at 
a high temperature of 40'»C comparable to those at 32»C were obtained. In the mutated and bred iyslne-producing B- 
6 strain constructed by repeating random mutation and selection, the growth and the lysine productivity are lowered 
at temperatures exceeding 34»C so that lysine fennentation cannot be carried out, whereas lysine fermentation can 
be carried out using the APZ.4 strain at a high temperature of 40«C so that the load of cooling is greatly reduced and 
it is Industrially useful. The lysine fermentation at high temperatures can be achieved by reflecting the high temperature 
adaptability inherently possessed by the wild type strain on the APZ-4 strain. 

[0402] As demonstrated In the reconstmction of the lysine-producing strain, the present invention provides a novel 
breeding method effective for eliminating the problems In the conventional mutants and acquiring industrially advan- 
tageous strains. This methodology which reconstitutes the production strain by reconstituting the effective mutation Is 
an approach which is efficiently earned out using the nucleotide sequence infomiation of the genome disclosed In the 
present Invention, and its effectiveness was found for the first time in the present Invention. 

Example 4 

Production of DNA microanray and use thereof 

[0403] A DNA microarray was produced based on the nucleotide sequence infonnatlon of the ORF deduced from 
the full nucleotide sequences of Corynebacterium glutamicum ATCC 1 3032 using software, and genes of which ex- 
pression is fluctuated depending on the carbon source during culturing were searched. 

(1) Production of DNA microanray 

[0404] Chromosomal DNA was prepared from Corynebacterium glutamicum ATCC 1 3032 by the method of Saito et 
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al. (Blochem. Blopfrys, Acta, 72: 619 (1963)), Based on 24 genes having the nucleotide sequences represented by 
SEQ ID NOS:207, 3433. 281, 3435. 3439. 765, 3445. 1226. 1229. 3448, 3461, 3453, 3465, 1743, 3470, 2132. 3476 
3477. 3485, 3488. 3489. 3494. 3496. and 3497 from the ORFs shown In Table 1 deduced from the full genome nucla^ 
otide sequence of Corynebacterfum gtutamicum ATCC 13032 using software and the nucleotide sequence of rabbit 
globin gene (GenBank Accession No. V00882) used as an Internal standard, ollgo DNA primers for PGR amplfficatlon 
represented by SEQ ID NOS:7010 to 7059 targeting the nucleotide sequences of the genes were synthesized in a 
usual manner. 

[0405] As the oligo DNA primers used for the PGR, 

[0406] DNAs having the nucleotide sequence represented by SEQ iD NOS:701 0 and 701 1 were used for the ampli- 
fication of the DNA having the nucleotide sequence represented by SEQ ID NO:207, 

[0407] DNAs having the nucleotide sequence represented by SEQ ID NOS:7012 and 7013 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3433, 

[0408] DNAs having the nucleotide sequence represented by SEQ ID NOS:7014 and 7016 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:281 , 

[0409] DNAs having the nucleotide sequence represented by SEQ (D NOS:701 6 and 701 7 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3435, 

[0410] DNAs having the nucleotide sequence represented by SEQ ID NOS:7018 and 7019 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3439. 

[0411] DNAs having the nucleotide sequence represented by SEQ ID NOS:7020 and 7021 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:765, 

[0412] DNAs having the nucleotide sequence represented by SEQ (D NOS:7022 and 7023 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3445, 

[0413] DNAs having the nucleotide sequence represented by SEQ ID NOS:7024 and 7025 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1226, 

[0414] DNAs having the nucleotide sequence represented by SEQ ID NOS:7026 and 7027 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1229, 

[0415] DNAs having the nucleotide sequence represented by SEQ ID NOS:7028 and 7029 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3448, 

[0416] DNAs having the nucleotide sequence represented by SEQ ID NOS:7030 and 7031 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3451, 

[0417] DNAs having the nucleotide sequence represented by SEQ ID NOS:7032 and 703i3 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3453, 

[0418] DNAs having the nucleotide sequence represented by SEQ ID NOS:7034 and 7035 were used for the am^ 
pliflcation of the DNA having the nucleotide sequence represented by SEQ ID NO:3455, 

[0419] DNAs having the nucleotide sequence represented by SEQ ID NOS:7036 and 7037 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:1743. 

[0420] DNAs having the nucleotide sequence represented by SEQ ID NOS:7038 and 7039 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3470, 

[0421] DNAs having the nucleotide sequence represented by SEQ ID NOS:7040 and 7041 were used for the alT^ 
pliflcation of the DNA having the nucleotide sequence represented by SEQ ID Nb:2132, 

[0422] DNAs having the nucleotide sequence represented by SEQ ID NOS:7042 and 7043 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3476. 

[0423] DNAs having the nucleotide sequence represented by SEQ ID NOS:7044 and 7045 were used for. the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3477, 

[0424] DNAs having the nucleotide sequence represented by SEQ ID NOS:7046 and 7047 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3485, 

[0425] DNAs having the nucleotide sequence represented by SEQ ID NOS:7048 and 7049 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3488, 

[0426] DNAs having the nucleotide sequence represented by SEQ ID NOS:7050 and 7051 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3489, 

[0427] DNAs having the nucleotide sequence represented by SEQ ID NOS:7052 and 7053 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3494. 

[0428] DNAs having the nucleotide sequence represented by SEQ ID NOS:7054 and 7055 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3496, 

[0429] DNAs having the nucleotide sequence represented by SEQ ID NOS:7056 and 7057 were used for the am- 
plification of the DNA having the nucleotide sequence represented by SEQ ID NO:3497, and 

[0430] DNAs having the nucleotide sequence represented by SEQ ID NOS:7058 and 7059 were used for the am- 
plification of the DNA having the nucleotide sequence of the rabbit globin gene. 



229 



EP1 108 790 A2 



as the respective primer set. 

[0431] The PGR was carried for 30 cycles with each cycle consisting of 15 seconds at 95"C and 3 minutes at 68*C 
using a thermal cycler (GeneAmp PGR system 9600, manufactured by Perkin Elmer), TaKaRa EX-Taq (manufactured 
by Takara Shuzo), 1 00 ng of the chromosomal DNA and the buffer attached to the TaKaRa Ex-Taq reagent. In the case 

5 of the rabbit globin gene, a single-stranded cDNA which had been synthesized from rabbit globin mRNA (manufactured 
by Life Technologies) according to the manufacture's instructions using a reverse transcriptase RAV-2 (manufactured 
by Takara Shuzo). The PGR product of each gene thus amplified was subjected to agarose gel electrophoresis and 
extracted and purified using QIAquick Gel Extraction Kit (manufactured by QIAGEN). The purified PGR product was 
concentrated by precipitating it with ethanol and adjusted to a concentration of 200 ng/|il. Each PGR product was 

10 spotted on a slide glass plate (manufactured by Matsunami Glass) having MAS coating in 2 runs using GTMASS 
SYSTEM (manufactured by Nippon Laser & Electronics Lab.) according to the manufacture's instructions. 

(2) Synthesis of fluorescence labeled cDNA 

IS [0432] The ATCG 1 3032 strain was spread on BY agar medium (medium prepared by adding 20 g of peptone (man- 
ufactured by Kyokuto Phamnaceuticai), 5 g of yeast extract (manufactured by Difco), and 16 g of Bactoagar (manufac- 
tured by Difco) to In 1 liter of water and adjusting Its pH to 7.2) and cultured at 30'*C for 2 days. Then, the cultured 
strain was further Inoculated into 5 ml of BY liquid medium and cultured at 30*»C overnight. Then, the cultured strain 
was further inoculated Into 30 ml of a minimum medium (medium prepared by adding 6 g of ammonium sulfate. 5 g of 

20 urea, 0.5 g of monopotasslum dihydrogenphosphate, 0.5 g of dipotasslum monohydrogenphosphate, 20.9 g of mor- 
phoiinopropanesulfonic acid, 0.25 g of magnesium sulfate heptahydrate, 10 mg of calcium chloride dihydrate, 10 mg 
of manganese sulfate monohydrate, 10 mg of fen-ous sulfate heptahydrate, 1 mg of zinc sulfate heptahydrate, 0.2 mg 
copper sulfate, and 0.2 mg blotin to 1 liter of water, and adjusting its pH to 6.5) containing 110 mmol/1 glucose or 200 
mmoW ammonium acetate, and cultured in an Erienmyer flask at 30* to give 1 .0 of absortDance at 660 nm. After the 

25 cells were prepared by centrif uging at 4*G and 5,000 rpm for 1 0 minutes, total RN A was prepared from the resulting 
cells according to the method of Bonmann etal. ( Molecuter Microbiology, &. 317-326 (1992)). To avoid contamination 
with DNA, the RN A was treated with Dnasel (manufactured by Takara Shuzo) at 37*C for 30 minutes and then further 
purified using Qiagen RNeasy MiniKit (manufactured by QIAGEN) according to the manufacture's instructions. To 30 
^ig of the resulting total RNA, 0.6 \l\ of rabbit globin mRNA (50 ng/jii, manufactured by Life Technologies) and 1 ^1 of 

30 a random 6 mer primer (500 ng/nJ, manufactured by Takara Shuzo) were added for denaturing at 65**G for 1 0 minutes, 
followed by quenching on ice. To the resulting solution, 6 jil of a buffer attached to Superscript 11 (manufactured by 
Lifetechnologies). 3 ^1 of 0.1 mol/l DTT, 1 .5 til of dNTPs (25 mmol/i dATP, 25 mmol/l dCTP, 25 mmol/l dGTP, 10 mmol/ 
I dTTP), 1 .5 ^1 of CyS-dUTP or Cy3-dUTP (manufactured by NEN) and 2 ^il of Superscript li were added, and allowed 
to stand at 25'*G for 1 0 minutes and then at 42*G for 1 1 0 minutes. The RNA extracted from the cells using glucose as 

35 the carbon source and the RNA extracted from the cells using ammonium acetate were labeled with GyS-dUTP and 
Gy3-dUTP, respectively. After the fluorescence labeling reaction, the RNA was digested by adding 1.5 ^1 of 1 mol/l 
sodium hydroxide-20 mmol/l EDTA solution and 3.0 \i\ of 10% SDS solution, and allowed to stand at 65»C for 10 
minutes. The two cDNA solutions after the labeling were mixed and purified using Qiagen PGR purification Kit (man- 
ufactured by QIAGEN) according to the manufacture's instructions to give a volume of 1 0 jil. 

40 

(3) Hybridization 

[0433] UltraHyb (1 1 0 (manufactured by Ambion) and the fluorescence-labeled cDNA solution (1 0 p.1) were mixed 
and subjected to hybridization and the subsequent washing of slide glass using GeneTAG Hybridization Station (man- 
45 ufactured by Genomic Solutions) according to the manufacture's Instmctions. The hybridization was candied out at 
50'*G, and the washing was carried out at 25*G. 

(4) Fluorescence analysis 

so [0434] The fluorescence amount of each DNA array having the fluorescent cDNA hybridized therewith was measured 
using ScanArray 4000 (manufactured by GSI Lumonics). 

[0435] Table 5 shows the Gy3 and Cy5 signal intensities of the genes having been connected on the basis of the data 
of the rabbit giobin used as the intemal standard and the Gy3/Cy5 ratios. 



Table 5 



SEQ ID NO 


Gy3 intensity 


Cy5 intensity 


Gy3/Gy5 


207 


5248 


3240 


1.62 
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Table 5 (continued) 



5 



10 



15 



20 



25 



SEQ ID NO 


Cy3 intensity 


Cy5 intensity 


Cy3/Cy5 


3433 


2239 


2694 


0.83 


2B1 


2370 


2595 


0.91 


3435 


2566 


2515 


1.02 


3439 


5597 


6944 


0.81 


765 


6134 


4943 


1.24 


3455 


1169 


1284 


0,91 


1226 


1301 


1493 


0.87 


1229 


1168 


1131 


1.03 


3448 


1187 


1594 


0.74 


3451 


2845 


3859 


0.74 


3453 


3498 


1705 


2.05 


3455 


1491 


1144 


1.30 


1743 


1972 


1841 


1.07 


3470 


4752 


3764 


1.26 


2132 


1173 


1085 


1.08 


3476 


1847 


1420 


1.30 


3477 


1284 


1164 


1.10 


3485 


4539 


8014 


0.57 


3488 


34289 


1398 


24.52 


3489 


43645 


1497 


29.16 


3494 


3199 


2503 


1.28 


3496 


3428 


2364 


1.45 


3497 


3848 


3358 


1.15 



[0436] The ORF function data estimated by using software were searched for SEQ ID NOS:3488 and 3489 showing 
remarkably strong Cy3 signals. As a result, it was found that SEQ ID NOS:3488 and 3489 are a maleate synthase 
gene and an Isocltrate lyase gene, respectively. It is l<nown that these genes are transcriptionally Induced by acetic 
acid In Corynebactertum giutamicum (/UcWves of Microbiology, 160: 262:269 (1997)). 

[0437] As described above, a gene of which expression is fluctuates could be discovered by synthesizing appropriate 
ollgo DNA primers based on the ORF nucleotide sequence information deduced from the full genomic nucleotide 
sequence Infonnatlon of Corynebactertum glutamicum ATCC 1 3032 using software, amplifying the nucleotide sequenc- 
es of the gene using the genome DNA of Corynebactertum glutamicum as a tennplate in the PGR reaction, and thus 
producing and using a DNA microarray. 

[0438] This Example shows that the expression amount can be analyzed using a DNA microarray in the 24 genes. 
On the other hand, the present DNA microan-ay techniques make it possible to prepare DNA mlcroan^ays having thereon 
several thousand gene probes at once. Accordingly, it is also possible to prepare DNA microarrays having thereon all 
of the ORF gene probes deduced from the full genomic nucleotide sequence of Corynebactertum glutamicum ATCC 
13032 detemnined by the present invention, and analyze the expression profile at the total gene level of Corynebac- 
tertum glutamicum using these arrays. 

Example 5 

Homology search using Corynebactertum glutamicum genome sequence 
(1) Search of adenosine deaminase 

[0439] The amino add sequence (ADD^ECOLI) of Eschertchia co// adenosine deaminase was obtained from Swlss- 
prot Database as the amino acid sequence of the protein of which function had beenconflmied as adenosine deaminase 
(EC3.5.4.4). By using the full length of this amino acid sequence as a query, a homology search was carried out on a 
nucleotide sequence database of the genome sequence of Corynebactertum glutamicum or a database of the amino 
acids in the ORF region deduced from the genome sequence using FASTA program (Prvc. NatL Acad, Scl. ISA, 65: 
2444-2448 (1 988)). A case where E-value was ie'^^ or less was Judged as being significantly homologous. As a result, 
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no sequence significantly homologous with the Escherichia coii adenosine deaminase was found In the nucleotide 
sequence database of the genome sequence of Corynebacterium gfutamicum or the database of the amino acid se- 
quences in the ORF region deduced from the genome sequence. Based on these results, It is assumed that Coryne- 
bacterium glutamicum contains no ORF having adenosine deaminase activity and thus has no activity of converting 
adenosine into inosine. 

(2) Search of glycine cleavage enzyme 

[0440] The sequences (GCSP_ECOLI, GCST_ECOLI and GCSH^ECOLI) of glycine decarboxylase, aminomethyl 
transferase and an aminomethyl group carrier each of which Is a component of Escherichia coii glycine cleavage 
enzyme as the amino acid sequence of the protein, of which function had been conf imfied as glycine cleavage enzyme 
(EC2. 1.2.10), were obtained from Swiss-prot Database. 

[0441 ] By using these ful l-length amino acid sequences as a q uery, a homology search was carried out on a nucleotide 
sequence database of the genome sequence of Coryriebacterium giutamicum or a database of the ORF amino acid 
sequences deduced from the genome sequence using FASTA program. A case where E-value was le-^o or less was 
Judged as being significantly homologous. As a result, no sequence significantly homologous with the glycine decar- 
boxylase, the aminomethyl transferase or the aminomethyl group carrier each of which is a component of Escherichia 
coii glycine cleavage enzyme, was found in the nucleotide sequence database of the genome sequence of Coryne- 
bacterium giutamicum or the database of the ORF amino acid sequences estimated from the genome sequence. Based 
on these results, It is assumed that Corynebacterium giutamicum contains no ORF having the activity of glycine de- 
carboxylase, aminomethyl transferase or the aminomethyl group carrier and thus has no activity of the glycine cleavage 
enzyme. 

(3) Search of IMP dehydrogenase 

[0442] The amino acid sequence (IMDH ECOLI) of Escherichia coiiMP dehydrogenase as the amino acid sequence 
of the protein, of which function had been conflnmed as IMP dehydrogenase (EC1 .1.1 .205), was obtained from Swiss- 
prot Database. By using the full length of this amino add sequence as a query, a homology search was canled out on 
a nucleotide sequence database of the genome sequence of Corynebacterium giutamicum or a database of the ORF 
amino acid sequences predicted from the genome sequence using FASTA program. A case where E-value was le-""** 
or less was judged as being significantly homologous. As a result, the amino acid sequences encoded by two ORFs, 
namely, an ORF positioned In the region of the nucleotide sequence No. 61 5336 to 61 6853 (or ORF having the nucle- 
otide sequence represented by SEQ ID NO:672) and another ORF positioned in the region of the nucjeotide sequence 
No. 616973 to 618094 (or ORF having the nucleotide sequence represented by SEQ ID NO:674) were significantly 
homologous with the ORFs of Escherichia coii IMP dehydrogenase. By using the above-described predicted amino 
acid sequence as a query in order to examine the similarity of the amino acid sequences encoded by the ORFs with 
IMP dehydrogenases of other organisms in greater detail, a search was earned out on GenBank (http:/Avww.ncbl.nlm. 
nih.gov/) nr-aa database (amino acid sequence database constructed on the basis of GenBankCDS translation prod- 
ucts, PDB database, SwIss-Prot database, PIR database. PRF database by eliminating duplicated registrations) using 
BLAST program. As a result, both of the two amino acid sequences showed significant homologies with IMP dehdy- 
rogenases of other organisms and cleariy higher homologies with IM P dehdyrogenases than with amino acid sequences 
of other proteins, and thus, it was assumed that the two ORFs would function as IMP dehydrogenase. Based on these 
results, it was therefore assumed that Corynebacterium giutamicum has two ORFs having the IMP dehydrogenase 
activity. 

Example 6 

Proteome analysis of proteins derived from Corynebacterium giutamhum 

(1) Preparations of proteins derived from Corynebacterium giutamicum f<TO0 13032, PERM BP-7134 and PERM BP- 
158 

[0443] Cuituring tests of Corynebacterium giutamicum ATCC 1 3032 (wild type strain), Corynebacterium giutamicum 
PERM BP-7134 (lyslne-producing strain) and Corynebacterium giutamicum (PERM BP-168, lysine-highly producing 
strain) were carried out in a 5 1 jar f ermenter according to the method In Example 2(3). The results are shown in Table 6. 



232 



EP1 108 790 A2 



Table 6 



10 



IS 



20 



25 



30 



40 



43 



Strain 


L-Lysine yield (g/l) 


ATCC 13032 


0 


FERM BP-7134 


45 


PERM BP-158 


60 



[0444] After cultuilng, cells of each strain were recovered by centrifugation. These ceils were washed with TrIs-HCI 
buffer (1 0 mmol/l Tris-HCI, pH 6.5, 1 .6 mg/ml protease Inhibitor (COM PLETE; manufactured by Boehringer Mannheim)) 
three times to give washed cells which could be stored under freezing at -80»C. The freeze-stored cells were thawed 
before use. and used as washed cells. 

[0445] The washed cells described above were suspended In a disruption buffer (1 0 mmol/l Tris-HCI. pH 7.4, 5 mmol/ 
I magnesium chloride. 50 mg/l RNase, 1.6 mg/ml protease Inhibitor (COMPLETE: manufactured by Boehringer Man- 
nheim)), and disrupted with a disrupter (manufactured by Brown) under cooling. To the resulting disruption solution, 
DNase was added to give a concentration of 60 mg/l, and allowed to stand on ice for 10 minutes. The solution was 
centrifuged (5,000 x g, 15 minutes, 4»C) to remove the undisrupted cells as the precipitate, and the supernatant was 
recovered. 

[04461 To the supematant, urea was added to give a concentration of 9 mol/l, and an equivalent amount of a lysis 
buffer (9.5 mol/l urea, 2% NP-40, 2% Ampholine, 5% mercaptoethanoi. 1.6 mg/ml protease inhibitor (COMPLETE; 
manufactured by Boehringer Mannheim) was added thereto, followed by thoroughly stimng at room temperature for 
dissolving. 

[0447] After being disso^ed, the solution was centrifuged at 12,000 x g for 15 minutes, and the supematant was 
recovered. 

[0448] To the supernatant, ammonium sulfate was added to the extent of 80% saturation, followed by thoroughly 
stining for dissolving. 

[0449] After being dlssoWed, the solution was centrifuged (16,000 x g, 20 minutes, 4*C). and the precipitate was 
recovered. This precipitate was dissolved In the lysis buffer again and used in the subsequent procedures as a protein 
sample. The protein concentration of this sample was determined by the method for quantifying protein of Bradford. 

(2) Separation of protein by two dimensional electrophoresis 

[0450] The first dimensional electrophoresis was carried out as described below by the isoelectric electrophoresis 
method. 

[0451] A molded dry IPG strip gel (pH 4-7, 13 cm, Immobiline DryStrips; manufactured by /^ersham Phannacia 
Biotech) was set In an eiectrop heretic apparatus (Multlphor II or IPGphor; manufactured by Amersham Phannacia 
Biotech) and a swelling solution (8 mol/i urea. 0.5% Triton X-1 00, 0.6% dithiothreltol, 0.5% Ampholine, pH 3-1 0) was 
packed therein, and the gel was allowed to stand for swelling 1 2 to 1 6 hours. 

[0452] The protein sample prepared above was dissolved in a sample solution (9 mol/i urea, 2% CHAPS, 1% dithi- 
othreltol , 2% /Vmphollne, pH 3-1 0), and then about 100 to 500 \Lg (In tenns of protein) portions thereof were talcen and 
added to the swollen IPG strip gel. 

[0453] The electrophoresis was carried out In the 4 steps as defined below under controlling the temperature to 20*C: 

step 1 : 1 hour under a gradient mode of 0 to 500V; 
step 2: 1 hour under a gradient mode of 500 to 1 ,000 V; 
step 3: 4 hours under a gradient mode of 1 ,000 to 8,000 V; and 
step 4: 1 hour at a constant voltage of 8,000 V. 



[0454] After the Isoelectric electrophoresis, the IPG strip gel was put off from the holder and soaked In an equilibration 
buffer A (50 mmol/l Tris-HCI, pH 6.8, 30% glycerol, 1% SOS, 0.25% dithiothreitol) for 15 minutes and another equlli- 
bration buffer B (50 mmol/l Tris-HCI, pH 6.8. 6 mol/l urea, 30% glycerol. 1% SDS, 0.45% iodo acetamide) for 15 minutes 
to sufficiently equilibrate the gel. 

[0455] After the equilibrium, the IPG strip gel was lightly rinsed in an SDS electrophoresis buffer (1 .4% glycine, 0.1 % 
SDS, 0.3% Tris-HCI, pH 8.5), and the second dimensional electrophoresis depending on molecular weight was carried 
out as described below to separate the proteins. 

[0456] Specifically, the above IPG strip gel was closely placed on 14% polyacrylamide slub gel (14% polyacrylamlde. 
0.37%.bisacryiamide, 37.5 mmol/l Tris-HCI. pH 8.8. 0.1% SDS. 0.1% TEMED. 0.1% ammonium persulfate) and sub- 
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jected to electrophoresis under a constant voltage of 30 mA at 20*C for 3 hours to separate the proteins. 

(3) Detection of protein spot 

5 [0457] Coomassie staining was performed by the method of Gorg et al. (Electrophoresis, 9: 531-546 (1988)) forthe 
siub gel after the second dimensional electrophoresis. Specifically, the slub gel was stained under shal<ing at 25*C for 
about 3 hours, the excessive coloration was removed with a decoloring solution, and the gel was thoroughly washed 
with distilled water. ^ « 

[0458] The results are shown in Fig. 2. The proteins derived from the ATCC 13032 strain (Fig. 2A), FERM BP-7134 

10 strain (Fig. 2B) and FERM BP-168 strain (Fig. 2C) could be separated and detected as spots. 

(4) In-gel digestion of detected protein spot 

[0459] The detected spots were each cut out from the gel and transfen-ed into siliconized tube, and 400 ^1 of 100 
IS mmol/1 ammonium bicarbonate : acetonitrile solution (1 :1 , v/v) was added thereto, followed by shaking overnight and 
f reeze-dried as such. To the dried gel. 1 0 ^il of a lysylendopeptidase (LysC) solution (manufactured by WAKO, prepared 
with 0.1% SDS-containing 50 mmol/i ammonium bicarbonate to give a concentration of 1 00 ng/\i\) was added and the 
gel was allowed to stand for swelling at ©•C for 45 minutes, and then allowed to stand at 37*C for 16 hours. After 
removing the LysC solution, 20 of an extracting solution (a mixture of 60% acetonitrile and 5% formic acid) was 
20 added, followed by ultrasonlcation at room temperature for 5 minutes to disrupt the gel. After the disruption, the extract 
was recovered by centrifugatlon (12,000 rpnn, 5 minutes, room temperature). This operation was repeated twice to 
recover the whole extract. The recovered extract was concentrated by centrifugatlon in vacuoXo halve the liquid volume. 
To the concentrate, 20 \l\ of 0.1% trifluoroacetic acid was added, followed by thoroughly stirring, and the mixture was 
subjected to desalting using ZIpTip (manufactured by Millipore). The protein absorbed on the carriers of ZIpTip was 
25 eluted with 5 \l\ of a-cyano-4-hydroxyclnnamic acid for use as a sample solution for analysis. 

(5) Mass spectrometry and amino acid sequence analysis of protein spot with matrix assisted laser desorptlon Ionization 
time of flight mass spectrometer (MALDI-TOFMS) 

30 [0460] The sample solution for analysis was mixed in the equivalent amount with a solution of a peptide mixture for 
mass calibration (300 nmol/l Angiotensin II, 300 nmol/l Neurotensin. 150 nmol/l ACTHclip 18-39, 2.3 p-moi/l bovine 
insulin B chain), and 1 m-I of the obtained solution was spotted on a stainless probe and crystallized by spontaneously 
drying. 

[0461] As measurement instruments. REFLEX MALDI-TOF mass spectrometer (manufactured by Bruker) and an 
35 N2 laser (337 nm) were used in combination. 

[0462] The analysis by PMF (peptide-mass finger printing) was can-led out using integration spectra data obtained 
by measuring 30 times at an accelerated voltage of 19.0 kV and a detector voltage of 1.50 kV under reflector mode 
conditions. Mass calibration was carried out by the intemal standard method. 

[0463] The PSD (post-source decay) analysis was can-led out using integration spectra obtained by successively 
40 altering the reflection voltage and the detector voltage at an accelerated voltage of 27.5 kV. 

[0464] The masses and amino acid sequences of the peptide fragments derived from the protein spot after digestion 
were thus detemnined. 

(6) Identification of protein spot 

45 - 

[0465] From the amino acid sequence infomnation of the digested peptide fragments derived from the protein spot 
obtained In the above (5), ORFs con-esponding to the protein were searched on the genome sequence database of 
Corynebacterium glutamicum ATCC 1 3032 as constructed in Example 1 to Identify the protein. 
[0466] The Identification of the protein was earned out using MS-Fit program and MS-Tag program of Intranet protein 
50 prospector. 

(a) Search and Identification of gene encoding high-expression protein 

[0467] in the proteins derived from Corynebacterium glutamicum ATCC 1 3032 showing high expression amounts in 
55 CBB-staining shown in Fig. 2A, the proteins con-esponding to Spots-1 , 2. 3, 4 and 5 were identified by the above method. 
[0468] As a result, it was found that Spot-1 corresponded to enolase which was a protein having the amino acid 
sequence of SEQ ID NO:4585; Spot-2 conresponded to phosphoglycelate kinase which was a protein having the amino 
acid sequence of SEQ ID NO:5254; Spot-3 corresponded to glyceraldehyde-3-phosphate dehydrogenase which was 
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a protein having the amino acid sequence represented by SEQ ID NO:5265; Spot-4 corresponded to fructose bis- 
phosphate aldolase which was a protein having the amino acid sequence represented by SEQ ID NO:6543; and Spot- 
5 corresponded to triose phosphate Isomerase which was a protein having the amino acid sequence represented bv 
SEQ ID NO:5252. ' 
[0469] These genes, represented by SEQ ID NOS:1 085, 1 754. 1 775. 3043 and 1 752 encoding the proteins corre- 
sponding to Spots-1, 2. 3, 4 and 5. respectively, encoding the known proteins are important in the central metabolic 
pathway for maintaining the life of the microorganism. Partlculariy, it Is suggested that the genes of Spots-2, 3 and 5 
form an operon and a high-expresslon promoter Is encoded In the upstream thereof (J. olEacteriol,, 174- 6067-6086 
(1992)). 

[0470] Also, the protein corresponding to Spot-9 in Rg. 2 was identified In the same manner as described above, 
and it was found that Spot-9 was an elongation factor Tu which was a protein having the amino acid sequence repre- 
sented by SEQ ID No:6937, and that the protein was encoded by DNA having the nucleotide sequence represented 
by SEQ ID No:3437. 

[0471] Based on these results, the proteins having high expression level were Identified by proteome analysis using 
the genome sequence database of Corynebacterium gfutam/cum constructed In Example 1 . Thus, the nucleotide se- 
quences of the genes encoding the proteins and the nucleotide sequences upstream thereof could be searched simul- 
taneously. Accordingly, it Is shown that nucleotide sequences having a function as a high-expression promoter can be 
efficiently selected. 



(b) Search and Identification of modified protein 

[0472] Among the proteins derived from Corynebacterium giutamicum PERM BP-7134 shown In Fig. 2B, Spots-e, 
7 and 8 were Identified by the above method. As a result, these three spots all corresponded to catalase which was a 
protein having the amino acid sequence represented by SEQ ID NO:3785. 

[0473] Accordingly, all of Spots-6, 7 and 8 detected as spots differing in isoelectric mobility were all products derived 
from a catalase gene having the nucleotide sequence represented by SEQ ID No:285. Accordingly, it is shown that 
the catalase derived from Corynebacterium giutamicum PERM BP-7134 was modified after the translation. 
[0474] Based on these results, It Is conflnned that various modified proteins can be efficiently searched by proteome 
analysis using the genome sequence database of Corynebacterium giutamicum constructed In Example 1 . 

(c) Search and identification of expressed protein effective in lysine production 

[0475] It was found out that In Fig. 2A (ATCC 13032: wild type strain). Fig. 2B (PERM BP-7134: lyslne-producing 
strain) and Fig. 2C (PERM BP-1 58: lyslne-highly producing strain), the catalase corresponding to Spot-8 and the elon- 
gation factor Tu corresponding to Spot-9 as Identified above showed the higher expression level with an Increase in 
the lysine productivity. 

[0476] Based on these results, it was found that hopeful mutated proteins can be efficiently searched and identified 
In breeding aiming at strengthening the productivity of a target product by the proteome analysis using the genome 
sequence database of Corynebacterium giutamicum constructed In Example 1 . 

[0477] Moreover, useful mutation points of useful mutants can be easily specified by searching the nucleotide se- 
quences (nucleotide sequences of promoter, ORF. or the like) relating to the identified proteins using the above data- 
base and using primers designed on the basis of the sequences. As a result of the fact that the mutation points are 
specified, industrially useful mutants which have the useful mutations or other useful mutations derived therefrom can 
be easily bred. 

[0478] While the Invention has been described in detail and with reference to specific embodiments thereof, It will 
be apparent to one of sidll In the art that various changes and modifications can be made therein without departing 
from the spirit and scope thereof. All references cited herein are incorporated In their entirety. 



Claims 

1 . A method for at least one of the following: 

(A) identifying a mutation point of a gene derived from a mutant of a coryneform bacterium, 

(B) measuring an expression amount of a gene derived from a corynefomi bacterium, 

(C) analyzing an expression profile of a gene derived from a coryneform bacterium. 

(D) analyzing expression pattems of genes derived from a coryneform bacterium, or 

(E) identifying a gene homologous to a gene derived from a coryneform bacterium. 
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said method comprising: 

(a) producing a polynucleotide an-ay by adhering to a solid support at least two polynucleotides selected 
from the group consisting of first polynucleotides comprising the nucleotide sequence represented by any 

5 one of SEQ ID NOS: 1 to 3501 , second polynucleotides which hybridize with the first polynucleotides under 

stringent conditions, and third polynucleotides comprising a sequence of 1 0 to 200 continuous bases of 
the first or second polynucleotides, 

(b) incubating the polynucleotide array with at least one of a labeled polynucleotide derived from a co- 
ryneform bacterium, a labeled polynucleotide derived from a mutant of the corynefomri bacterium or a 

10 labeled polynucleotide to be examined, under hybridization conditions, 

(c) detecting any hybridization, and 

(d) analyzing the result of the hybridization. 

2. The method according to claim 1 , wherein the corynefonn bacterium Is a microorganism belonging to the genus 
IS Corynebacterium, the genus Brevibacterium, or the genus Microbacterium. 

3. The method according to claim 2, wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynebacterium glutamicum, Corynebacterium acetoacidophilum, Corynebacterium 
acetogiutamlcum, Corynebacterium caliunae, Corynebacterium herculis, Corynebacterium liUum, Corynebacteri- 

20 urn melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

4. The method according to claim 1 , wherein the polynucleotide derived from a corynefomn bacterium, the polynuce- 
lotide derived from a mutant of the corynefonn bacterium or the polynucleotide to be examined is a gene relating 
to the biosynthesis of at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide. 

25 an organic acid, and analogues thereof. 

5. The method according to claim 1 , wherein the polynucleotide to be examined is derived from Eschertohia coU, 

6. A polynucleotide array, comprising: 

30 

at least two potynucieotides selected from the group consisting of first polynucleotides comprising the nucle- 
otide sequence represented by any one of SEQ ID NOS:1 to 3501 . second polynucleotides which hybridize 
with the first polynucleotides under stringent conditions, and third polynucleotides comprising 10 to 200 con- 
tinuous bases of the first or second polynucleotides, and 
35 a solid support adhered thereto. 

7. A polynucleotide comprising the nucleotide sequence represented by SEQ ID NO:1 or a polynucleotide having a 
homology of at least 80% with the polynucleotide. 

40 8. A polynucleotide comprising any one of the nucleotide sequences represented by SEQ ID NOS:2 to 3431 , or a 
polynucleotide which hybridizes with the polynucleotide under stringent conditions. 

9. A polynucleotide encoding a polypeptide having any one of the amino acid sequences represented by SEQ ID 
NOS:3502 to 6931 , or a polynucleotide which hybridizes therewith under stringent conditions. 

45 

10, A polynucleotide which is present in the 5' upstream or 3' downstream of a polynucleotide comprising the nucleotide 
sequence of any one of SEQ ID NOS:2 to 3431 in a whole polynucleotide comprising the nucleotide sequence 
represented by SEQ ID NO:1. and has an activity of regulating an expression of the polynucleotide. 

50 1 1 . A polynucleotide comprising 1 0 to 200 continuous bases in the nucleotide sequence of the polynucleotide of any 
one of claims 7 to 1 0, or a polynucleotide comprising a nucleotide sequence complementary to the polynucleotide 
comprising 10 to 200 continuous based. 

12. A recombinant DNA comprising the polynucleotide of any one of claims 8 to 11 . 

55 

13. A transformant comprising the polynucleotide of any one of claims 8 to 11 or the recombinant DNA of claim 12. 

1 4. A method for producing a polypeptide, comprising: 
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cuttuiing the transfomnant of claim 13 in a medium to produce and accumulate a polypeptide encoded by the 
polynucleotide of claim 8 or 9 in the medium, and 
recovering the polypeptide from the medium. 

5 1 5. A method for producing at least one of an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and 
analogues thereof, comprising: 

culturing the transformant of claim 13 in a medium to produce and accumulate at least one of an amino acid, 
a nucleic add, a vitamin, a saccharide, an organic acid, and analogues thereof in the medium, and 
recovering the at least one of the amino acid, the nucleic acid, the vitamin, the saccharide, the organic acid, 
and analogues thereof from the medium. 

16. A polypeptide encoded by a polynucleotide comprising the nucleotide sequence selected from SEQ ID NOS:2 to 
3431. 



15. 



17. A polypeptide comprising the amino acid sequence selected from SEQ ID NOS:3502 to 6931 . 



18. The potypeptlde according to claim 16 or 17, wherein at least one amino acid is deleted, replaced, inserted or 
added, said polypeptides having an activity which is substantially the same as that of the polypeptide without said 

^ at least one amino acid deletion, replacement, insertion or addition. 

1 9. A polypeptide comp rising an amino acid seq uence having a homology of at least 60% with the amino acid sequence 
of the polypeptide of claim 1 6 or 1 7, and having an activity which is substantially the same as that of the polypeptide. 

S9 20. An antibody which recognizes the polypeptide of any one of claims 1 6 to 1 9. 

21 . A polypeptide an^ay. comprising: 

at least one polypeptide or partial fragment polypeptide selected from the polypeptides of claims 1 6 to 1 9 and 
^ partial fragment polypeptides of the polypeptides, and 

a solid support adhered thereto. 

22. A polypeptide array, comprising: 

35 at least one antibody which recognizes a polypeptide or partial fragment polypeptide selected from the polypep- 

tides of claims 1 6 to 1 9 and partial fragment polypeptides of the polypeptides, and 
a solid support adhered thereto. 

. _.23, ,A systenibased on a computer for Identifying a target sequence or a target structure motif derived from a coryne- 
^ fomi bacterium, comprising the following: 

(i) a user input device that inputs at least one nucleotide sequence Infomnation selected from SEQ ID NOS:1 
to 3501 , and target sequence or target structure motif information; 

(ii) a data storage device for at least temporarily storing the input infonnation; 

^ (iii) a comparator that compares the at least one nucleotide sequence information selected from SEQ ID NOS: 

1 to 350'1 with the target sequence or target structure motif information, recorded by the data storage device 
for screening and analyzing nucleotide sequence Infomiation which Is coincident with or analogous to the 
target sequence or target structure motif infonnation; and 

(rv) an output device that shows a screening or analyzing result obtained by the comparator. 

50 

24. A method based on a computer for identifying a target sequence or a target structure motif derived from a coryne- 
fonm bacterium, comprising the following: 

(0 inputting at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501, target se- 
^ quence information or target structure motif information into a user input device; 

(11) at least temporarily storing said information; 

(iii) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:1 to 3501 with 
the target sequence or target structure motif information; and 
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(iv) screening and analyzing nucleotide sequence infonnation which is coincident with or analogous to the 
target sequence or target structure motif infomnatlon. 

25. A system based on a computer for Identifying a target sequence or a target structure motif derived from a coryne- 
form bacterium, comprising the following: 

(0 a user input device that inputs at least one amino acid sequence infonnation selected from SEQ ID NOS: 
3502 to 7001 , and target sequence or target structure motif information; 

(II) a data storage device for at least temporarily storing the input information; 

(ill) a comparator that compares the at least one amino acid sequence information selected from SEQ ID NOS: 
3502 to 7001 with the target sequence or target structure motif infomnatlon, recorded by the data storage 
device for screening and analyzing amino acid sequence information which is coincident with or analogous to 
the target sequence or target structure motif information; and 

(hr) an output device that shows a screening or analyzing result obtained by the comparator. 

26. A method based on a computer for identifying a target sequence or a target structure motif derived from a coiyne- 
fomri bacterium, comprising the following: 

(i) Inputting at least one amino acid sequence infonnation selected from SEQ ID NOS:3502 to 7001 . and target 
sequence information or target structure motif infonnation Into a user input device; 

(il) at least temporarily storing said information; 

(ill) comparing the at least one amino acid sequence infonnation selected from SEQ ID NOS:3502 to 7001 
with the target sequence or target structure motif Information; and 

(iv) screening and analyzing amino acid sequence infonnation which is coincident with or analogous to the 
target sequence or target structure motif infonnation. 

27. A system based on a computer for determining a function of a polypeptide encoded by a polynucleotide having a 
target nucleotide sequence derived from a corynefonn bacterium, comprising the following: 

(I) a user input device that inputs at least one nucleotide sequence Infonnation selected from SEQ ID NOS:2 
to 3501 , function infonnation of a polypeptide encoded by the nucleotide sequence, and target nucleotide 

sequence information; 

(ii) a data storage device for at least temporarily storing the input information; 

(III) a comparator that compares the at least one nucleotide sequence Infonnation selected from SEQ ID NOS: 
2 to 3501 with the target nucleotide sequence Information for detennlning a function of a polypeptide encoded 
by a polynucleotide having the target nucleotide sequence which is coincident with or analogous to the poly- 
nucleotide having at least one nucleotide sequence selected from SEQ ID NOS:2 to 3501 ; and 

(iv) an output devices that shows a function obtained by the comparator. 

28. A method based on a computer for determining a function of a polypeptide encoded by a polypeptide encoded by 
a polynucleotide having a target nucleotide sequence derived from a coryneform bacterium, comprising the fol- 
lowing: 

(I) Inputting at least one nucleotide sequence infonnation selected from SEQ ID NOS:2 to 3501, function in- 
formation of a polypeptide encoded by the nucleotide sequence, and target nucleotide sequence information; 
(ii) at least temporarily storing said infonnation; 

(ill) comparing the at least one nucleotide sequence information selected from SEQ ID NOS:2 to 3501 with 
the target nucleotide sequence information; and 

(Iv) detennlning a function of a polypeptide encoded by a polynucleotide having the target nucleotide sequence 
which is coincident with or analogous to the polynucleotide having at least one nucleotide sequence selected 
from SEQ ID NOS:2 to 3501 . 

29. A system based on a computer for detennlning a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 

(I) a user Input device that inputs at least one amino acid sequence infonnation selected from SEQ ID NOS: 
3502 to 7001 . function information based on the amino acid sequence, and target amino acid sequence infor- 
mation; 
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10 



25 



30 



35 



(ii) a data storing device for at least temporarily storing the Input Information; 

(III) a comparator that compares the at least one amino acid sequence Information selected from SEQ 10 NOS: 
3502 to 7001 with the target amino acid sequence infonrnation for detemnlnlng a function of a polypeptide 
having the target amino acid sequence which is coincident with or analogous to the polypeptide having at least 
one amino acid sequence selected from SEQ ID NOS:3502 to 7001 ; and 
(iv) an output device that shows a function obtained by the comparator. 

30. A method based on a computer for determining a function of a polypeptide having a target amino acid sequence 
derived from a coryneform bacterium, comprising the following: 



(i) inputting at least one amino acid sequence Infonnation selected from SEQ ID NOS:3502 to 7001 , function 
Information based on the amino acid sequence, and target amino acid sequence information; 

(ii) at least temporarily storing said information; 

(ili) comparing the at least one amino acid sequence infonmation selected from SEQ ID NOS:3502 to 7001 
with the target amino acid sequence infomiation; and 

(iv) determining a function of a polypeptide having the target amino acid sequence which is coincident with or 
analogous to the polypeptide having at least one amino acid sequence selected from SEQ ID NOS:3S02 to 
7001, 

^ 31 . The system according to any one of claims 23, 25, 27 and 29, wherein a corynefonn bacterium is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterfum. 

32. The method according to any one of claims 24, 26, 28 and 30, wherein a corynef omi bacterium Is a microorganism 
of the genus Corynebacterium, the genus Brevibacterium, or the genus Microbacterfum, 



33. The system according to claim 31 , wherein the microorganism belonging to the genus Corynebacterium is selected 
from the group consisting of Corynet>acterium giutamicum, Corynebacterium acetoaddophHum, Corynebacterium 
acetogiutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynebacterium lilium, Corynebacteri- 
um melassecda, Corynetfacterium thermoaminogenes, and Corynebacterium ammoniagenes. 

34. The method according to claim 32, wherein the microorganism belonging to the genus Corynebacterium Is selected 
from the group consisting of Corynetiacterium giutamicum, Corynebacterium acetoad<k)phiium, Corynebacterium 
acetogiutamicum, Corynebacterium callunae, Corynebacterium hercuiis, Corynetyacterium liUum, Corynebacteri- 
um melassecoia, Corynetyacterium thermoaminogenes, and Corynebacterium ammoniagerws, 

35. A recording medium or storage device which is readable by a computer in which at least one nucleotide sequence 
infonnation selected from SEQ ID NOS:1 to 3501 or function information based on the nucleotide sequence Is 
recorded, and Is usable in the system of claim 23 or 27 or the method of claim 24 or 28. 



'*o 36. A recording rnedium or storage device which is readable by a computer in which at least one amino acid sequence . 
infomiation selected from SEQ ID NOS:3502 to 7001 or function Information based on the amino acid sequence 
Is recorded, and is usable in the system of claim 25 or 29 or the method of claim 26 or 30. 

37. The recording medium or storage device according to claim 35 or 36, which Is a computer readable recording 
medium selected from the group consisting of a floppy disc, a hard disc, a magnetic tape, a random access memory 

- " (RAIVI), a read only memory (ROi\rt). a magneto-optic disc (MO), CD=ROM, CD-R, CD-RW. DVD^ROM, DVD-RAM 
and DVD-RW. 

38. A polypeptide having a homoserine dehydrogenase activity, comprising an amino acid sequence in which the Val 
so residue at the 59th in the amino acid sequence of homoserine dehydrogenase derived from a coryneform bacterium 

Is replaced with an amino acid residue other than a Val residue. 

39. A polypeptide comprising an amino acid sequence in which the Vat residue at the 59th position in the amino acid 
sequence as represented by SEQ ID NO:6952 Is replaced with an amino acid residue other than a Val residue. 



55 



40. The polypeptide according to claim 38 or 39, wherein the Val residue at the 59th position Is replaced with an Ala 
residue. 
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41. A polypeptide having pyruvate carboxylase activity, comprising an amino acid sequence In which the Pro residue 
at the 458th position In the amino acid sequence of pyruvate cariDoxylase derived from a coryneform bacterium is 
replaced with an amino acid residue other than a Pro residue. 

5 42. A polypeptide comprising an amino acid sequence in which the Pro residue at the 458th position In the amino acid 
sequence represented by SEQ ID NO:4265 is replaced with an amino acid residue other than a Pro residue. 

43. The polypeptide according to claim 41 or 42. wherein the Pro residue at the 468th position Is replaced with a Ser 
residue. 

10 

44. The polypeptide according to any one of claims 38 to 43, which is derived from Corynebacterium gtutamicum, 

45. A DNA encoding the polypeptide of any one of claims 38 to 44. 
15 48. A recombinant DNA comprising the DNA of claim 45. 

47. A transformant comprising the recombinant DNA of claim 46. 

48. A transfomnant comprising in its chromosome the DNA of claim 46. 

20 

49. The transformant according to claim 47 or 48, which is derived from a coryneform bacterium. 

50. The transfonmant according to claim 49. which is derived from Corynebacterium glutamicum. 
25 51. A method for producing L-tysine, comprising: 

culturing the transfomiant of any one of claims 47 to 50 in a medium to produce and accumulate L-lyslne In 
the medium, and 

recovering the L-lyslne from the culture. 

30 

52. A method for breeding a coryneform bacterium using the nucleotide sequence Infomnatlon represented by SEQ 
ID NOS:1 to 3431 , comprising the following: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a corynefonm bacte- 
35 rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fenmentatlon 
method, with a corresponding nucleotide sequence In SEQ ID NOS:1 to 3431; 

(ii) Identifying a mutation point present In the production strain based on a result obtained by (i); 

(Hi) introducing the mutation point into a corynefomn bacterium which is free of the mutation point, or deleting 
40 the mutation point from a corynefonn bacterium having the mutation point; and 

(Iv) examining productivity by the femnentatlon method of the compound selected in (1) of the corynefonm 
bacterium obtained In (III). 

53. The method according to claim 52, wherein the gene is a gene encoding an enzyme in a blosynthetic pathway or 
45 a signal transmission pathway. 

54. The method according to claim 52, wherein the mutation point Is a mutation point relating to a useful mutation 
which improves or stabilizes the productivity. 

«> 55. A method for breading a corynefomi bacterium using the nucleotide sequence information represented by SEQ 
ID NOS:1 to 3431, comprising: 

(i) comparing a nucleotide sequence of a genome or gene of a production strain derived a corynefonm bacte- 
rium which has been subjected to mutation breeding so as to produce at least one compound selected from 

55 an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof by a fermentation 

method, with a corresponding nucleotide sequence In SEQ ID NOSil to 3431 ; 

(ii) Identifying a mutation point present In the production strain based on a result obtain by (I); 

(iii) deleting a mutation point from a coryneform bacterium having the mutation point; and 
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(Iv) examining productivity by the fermentation nnethod of the compound selected In (I) of the coiyneform 
bacterium obtained In (iil). 

56. The method according to claim 55, wherein the gene is a gene encoding an enzyme In a blosynthetic pathway or 
a signal transmission pathway. 

57. The method according to claim 55. wherein the mutation point Is a mutation point which decreases or destabilizes 
the productivity. 

58. A method for breeding a corynefonm bacterium using the nucleotide sequence information represented by SEQ 
ID NOS2 to 3431. comprising the following: 

(i) identifying an isozyme relating to biosynthesis of at least one compound selected from an amino acid, a 
nucleic acid, a vitamin, a saccharide, an organic acid, and analogous thereof, based on the nucleotide se- 
quence information represented by SEQ ID NOS:2 to 3431 ; 
(li) classifying the Isozyme Identified in (1) Into an Isozyme having the same activity; 
(ill) mutating ail genes encoding the Isozyme having the same activity simultaneously; and 

(iv) examining productivity by a femrientatlon method of the compound selected In (1) of the coryneform bac- 
terium which have been transformed with the gene obtained in (ill). 

59. A method for breeding a corynefomri bacterium using the nucleotide sequence Information represented by SEQ 
ID NOS:2 to 3431 , comprising the following: 

(0 an^nging a function infonmation of an open reading frame (ORF) represented by SEQ ID NOS:2 to 3431 ; 
(") allowing the an^nged ORF to correspond to an enzyme on a known biosynthesis or signal transmission 
pathway; 

(Hi) explicating an unl^nown biosynthesis pathway or signal transmission pathway of a coryneform bacterium 
in combination with information relating l<nown biosynthesis pathway or signal transmission pathway of a co- 
rynefomi bacterium; 

^ (Iv) comparing the pathway explicated in (iil) with a biosynthesis pathway of a target useful product; and 

(v) transgenetically varying a corynefomn bacterium based oh the nucleotide sequence information to either 
strengthen a pathway which Is judged to be Important in the biosynthesis of the target useful product in (Iv) or 
weaken a pathway which is judged not to be important In the biosynthesis of the target useful product in (iv). 

35 60. A coryneform bacterium, bred by the method of any one of claims 52 to 59. 

61. The corynefomi bacterium according to claim 60, which is a microorganism belonging to the genus Corynebacte- 
hum, the genus Brevibactertum, or the genus Mlcrobacterium, 

40 - -62rThe corynefomi'bacterium according to claim 61 , wherein the nnicfoorganlsrh belonging to the genus Corynebao 
terfum Is selected from the group consisting of Corynebacterfum gfutamicum, Corynebactertum acetoaddophllum, 
Corynebacterium acetog/utamicum, Corynebactertum callunae, Corynebactertum hercuUs, corynebactertum lit- 
ium, Corynebacterium melassecola, Corynebactertum thermoamlno genes, and Corynebactertum ammonia 
genes. 
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63. A method forproducing at least one compound selected from an amino acid, a nucleic acid, a vitamin, a saccharide, 
an organic acid and an analogue thereof, comprising: 

culturing a corynefomi bacterium of any one of claims 60 to 62 In a medium to produce and accumulate at 
least one compound selected from an amino acid, a nucleic add, a vitamin, a saccharide, an organic acid, 
and analogues thereof; 
recovering the compound from the culture. 

64. The method according to claim 63, wherein the compound Is L-lysine. 

65. A method for Identifying a protein relating to useful mutation based on proteome analysis, comprising the following: 

(i) preparing 
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a protein derived from a bacterium of a production strain of a coryneform bacterium wliich has been sub- 
jected to mutation breeding by a fermentation process so as to produce at least one compound selected 
from an amino acid, a nucleic acid, a vitamin, a saccharide, an organic acid, and analogues thereof, and 
a protein derived from a bacterium of a parent strain of the production strain; 

(ii) separating the proteins prepared in (I) by two dimensional electrophoresis; 

(iii) detecting the separated proteins, and comparing an expression amount of the protein derived from the 
production strain with that derived from the parent strain; 

{W) treating the protein showing different expression amounts as a result of the comparison with a peptidase 
to extract peptide fragments; 

(v) analyzing amino acid sequences of the peptide fragments obtained in (iv); and 

(vi) comparing the amino acid sequences obtained in (v) with the amino acid sequence represented by SEQ 
ID NOS:3502 to 7001 to identifying the protein having the amino acid sequences. 

66. The method according to claim 65. wherein the coryneform bacterium Is a microorganism belonging to the genus 
corynebactehum, the genus Brevibacterium, or the genus Microbacterium. 

67. The method according to claim 66, wherein the microorganism belonging to the genus Corynebactehum is selected 
from the group consisting of CorynebacteHum gfutamicum, Corynebactertum acetoaddophilum, Corynebacterium 
acetoglutamicum, Corynebacterium caliunae, Corynebacterium herculis, Corynebacterium fiiium, Corynebacteri- 
um melassecola, Corynebacterium thermoaminogenes, and Corynebacterium ammoniagenes, 

68. A biologically pure culture of Corynebacterium glutamicum AHP-3 (PERM BP-7382) . 
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